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OXIDATION BY HYDROGEN PEROXIDE 

Field of the Invention - 

The invention relates to a method of carrying out an oxidation reaction. 

5 

Background of the Invention 

> 

Monooxygenase enzymes catalyse the oxidation of a very wide range of 
substrates. In order to catalyse the reaction, a monooxygenase enzyme generally 
requires a cofactor and at least one electron-transfer partner protein (reductase). 
10 However, monooxygenase enzymes are capable of using hydrogen peroxide (H2O2) 
as an oxidizing agent because it acts as a source of dioxygen and two electrons. The 
use of H2O2 to drive oxidation reactions is known as the "peroxide shunt". 

Summary of the invention 
15 Monooxygenase enzymes generally have a high K m for H202, (such as about 

20mM) in comparison to peroxidase enzymes. As a result, high concentrations of 
H2O2 are required for appreciable levels of activity of a monooxygenase enzyme 
when the oxidation reaction is performed using the peroxide shunt. For example, the 

* a 

initial rate of monooxygenase activity using 50mM H2O2 is far below that when the 
20 natural co-factor, NAD(P)H, is used as with the physiological electron-transfer 
partners. 

The invention provides a more efficient method of carrying out an oxidation 
reaction using the peroxide shunt by reducing the oxidative damage that occurs to the 
monooxygenase enzyme by not allowing excess levels of H2O2 to be present whilst 
25 the reaction is carried out. 

Simultaneous production of H2O2 at a rate less than or equal to the rate at 
which it is used in an oxidation reaction catalysed by monooxygenase results in 
improved efficiency of the oxidation reaction and an increase in the product yield. 
Various methods may be used to produce H2O2 at the required rate, such as use of an 
30 electrochemical reaction, an enzyme or a precursor. 

Accordingly, the present invention provides a method of carrying oxxt an 
oxidation reaction catalysed by a monooxygenase enzyme and using hydrogen 
peroxide as an oxidant, in which reaction a low level of oxidation damage of the 
* monooxygenase occurs, said method comprising producing the hydrogen peroxide 
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simultaneously with the oxidation reaction, wherein the hydrogen peroxide is 
produced at a rate less than or equal to the rate at which it is used in the reaction. 

The present invention also provides a method of carrying out an oxidation 
reaction catalysed by a monooxygenase enzyme and using hydrogen peroxide as an 
5 oxidant, in which reaction a low level of oxidation damage of the monooxygenase 
occurs, said method comprising carrying out the reaction in the presence of an H2O2 
or hydroxyl radical sequestering agent that controls the H 2 0 2 or hydroxyl radical 
concentration. 

10 Description of the Sequences 

SEQ ID NO: 1 shows the nucleotide sequence of cytochrome P450Cam from 
Pseudomonas putida. 

SEQ ID NO: 2 shows the amino acid sequence of cytochrome P450Cam from 
Pseudomonas putida. 

15 SEQ ID NO: 3 shows the nucleotide sequence of cytochrome P450BM-3 

from Bacillus megaterium. 

SEQ ID NO: 4 shows the amino acid sequence of cytochrome P450 BM-3 
from Bacillus megaterium. The first 472 amino acid residues form the heme domain. 
The last 585 amino acid residues form the reductase domain. All 1048 amino acid 
20 residues form the holoenzyme. 

The convention in the art, which is adopted herein, is to refer to a mutant with 
reference to the native amino acid residue at a position in the sequence, followed by 
the amino acid at that position in the mutant, e. g., F87 refers to the phenylalanine at 
position 87 in the wild-type sequence, and F87A refers to the phenylalanine at 
25 position 87 in the wild-type sequence which has been changed to alanine in the 

variant. The numbering of the amino acid residues starts with the amino acid residue 
following the initial methionine residue. 

Mutants used in Examples were F87A (single mutation; SEQ ID NOs: 5 and 
6) and F87V L188Q A74G (triple mutation; SEQ ID NOs: 7 and 8). 
30 SEQ ID NO: 5 shows the amino acid sequence of the F87A mutant of 

cytochrome P450BM-3 from Bacillus megaterium. 

SEQ ID NO: 6 shows the nucleotide sequence of of the F87A mutant of 
cytochrome P450BM-3 from Bacillus me gat er mm. 

SEQ ID NO: 7 shows the amino acid sequence of the F87V L188Q A74G 



mutant of cytochrome P450BM-3 from Bacillus megaterium. 

SEQ ID NO: 8 shows the nucleotide sequence of of the F87V L188Q A74G 
mutant of cytochrome P450BM-3 from Bacillus megaterium. 

SEQ ID NO: 9 shows the nucleotide sequence of subunit 1 of B-276 alkene 
epoxidase from Nocardia coralline. 

SEQ ID NO: 10 shows the amino acid sequence of subunit 1 of B-276 alkene 
epoxidase from Nocardia coralline. 

SEQ ID NO: 1 1 shows the nucleotide sequence of subunit 2 of B-276 alkene 
epoxidase from Nocardia coralline. 

SEQ ID NO: 12 shows the amino acid sequence of subunit 2 of B-276 alkene 
epoxidase from Nocardia coralline. 

SEQ ID NO: 13 shows the nucleotide sequence of the alpha subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 14 shows the amino acid sequence of the alpha subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 15 shows the nucleotide sequence of the beta subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 16 shows the amino acid sequence of the beta subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 17 shows the nucleotide sequence of the gamma subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 1 8 shows the amino acid sequence of the gamma subunit of Py2 
alkene monooxygenase from Xanthobacta sp. 

SEQ ID NO: 19 shows the nucleotide sequence of the alpha subunit of 
soluble methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 20 shows the amino acid sequence of the alpha subunit of 
soluble methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 21 shows the nucleotide sequence of the beta subunit of soluble 
methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 22 shows the amino acid sequence of the beta subunit of soluble 
methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 23 shows the nucleotide sequence of the gamma subunit of 
soluble methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 24 shows the amino acid sequence of the gamma subunit of 
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soluble methane monooxygenase from Methylococcus capsulatas. 

SEQ ID NO: 25 shows the nucleotide sequence of GPol alkane hydroxylase 
(AlkB gene) from Pseudomonas oleovorans. 

SEQ ID NO: 26 shows the amino acid sequence of GPol alkane hydroxylase 
5 from Pseudomonas oleovorans, 

SEQ ID NO: 27 shows the nucleotide sequence of the alpha subunit of 
toluene 2-monooxygenase from Burkholderia cepacia, 

SEQ ID NO: 28 shows the amino acid sequence of the alpha subunit of 
toluene 2-monooxygenase from Burkholderia cepacia, 
10 SEQ ID NO: 29 shows the nucleotide sequence of the beta subunit of toluene 

2-monooxygenase from Burkholderia cepacia. 

SEQ ID NO: 30 shows the amino acid sequence of the beta subunit of toluene 
2-monooxygenase from Burkholderia cepacia. 

SEQ ID NO: 3 1 shows the nucleotide sequence of the gamma subunit of 
15 toluene 2-monooxygenase from Burkholderia cepacia. 

SEQ ID NO: 32 shows the amino acid sequence of the gamma subunit of 
toluene 2-monooxygenase from Burkholderia cepacia. 

* * 

SEQ ID NO: 33 shows the nucleotide sequence of phenol hydroxylase (pheA) 
gene from Bacillus stearothermophilus. 
20 SEQ ID NO: 34 shows the amino acid sequence of phenol hydroxylase gene 

from Bacillus stearothermophilus. 

SEQ ID NO: 35 shows the nucleotide sequence of stearoyl-ACP desaturase 
from Helianthus annuus. 

SEQ ID NO: 36 shows the amino acid sequence of stearoyl-ACP desaturase 
25 from Helianthus annuus. 

Detailed description of the Invention 

It is to be understood that this invention is not limited to particular 
embodiments. It is also to be understood that different applications of the disclosed 
30 methods may be tailored to the specific needs in the art. It is also to be understood 
that the terminology used herein is for the purpose of describing particular 
embodiments of the invention only, and is not intended to be limiting. 

In addition as used in this specification and the appended claims, the 
singular forms "a", "an'\ and "the" include plural referents unless the content clearly 



dictates otherwise. Thus, for example, reference to "a substrate" includes two or 
more substrates, reference to "an enzyme" includes reference to two or more 
enzymes, and the like. . 

All publications, patents and patent applications cited herein, whether supra 
or infra, are hereby incorporated by reference in their entirety. 

The methods of the invention enable the oxidation of a variety of substrates. 
Such substrates include, but are not limited to, alkanes, aromatic compounds, 
terpenoid compounds, alkenes and fatty acids. 

Suitable alkanes include, but are not limited to, methane, ethane, propane, 
butane, pentane, hexane, heptane, rc-octane, «-nonane, »-decane, n-dodecane and n- 
hexadecane. The oxidation of alkanes produces alcohols. The oxidation of methane 
to methanol is technologically and economically very important. The medium-chain 
alcohols (e.g. «-octanol) are synthetic intermediates while the longer chain alcohols 
(e.g. rc-dodecanol) are used for the synthesis of fatty acid derivatives. 

Suitable aromatic compounds include, but are not limited to, benzene, 
toluene, xylene, chlorobenzene, phenol and substituents thereof. The phenolic and 
catecholic products are used in the synthesis of fragrance and flavour compounds. 

Suitable terpenoid compounds include, but are not limited to, monoterpenes 
stich as limonene, pinene, terpinene, and ocimene, sesquiterpenes such as valencene 
and aromadendrene and triterpenes which include the steroidal. compounds. The 
products are intermediates for synthesis, fine fragrance and flavouring chemicals and 
pharmaceuticals. 

Suitable alkenes inchxde, but are not limited to, simple molecules such as 
propene, hex-l-ene, hex-2-ene, and styrene, and carbon-carbon double bonds in 
complex molecules. Selective epoxidation of alkenes to a single enantiomer is very 
important in synthesis. Optically pure propene oxide and styrene oxide are very 
useful intermediates in synthesis. 

Hydroxylated fatty acids are precursors to polymers. 

Monooxygenase enzyme 

The enzyme used to carry out an oxidation reaction according to the 
invention is a monooxygenase enzyme. A person . skilled in the art can determine 
whether an enzyme is a monooxygenase enzyme using standard techniques in the art. 
Typically, the prosthetic groups may be characterised using protein crystallography, 



especially for non-heme iron enzymes because they generally do not have 
chromophores. Otherwise, a person skilled in the art will typically use sequence 
alignment, looking for conserved motifs such as the active site, and iron content as 
well as subunit composition. 

The monooxygenase enzyme preferably has a K m for H2O2 of at least 1 5nM, 
at least 20nM, at least 25nM, at least 30nM, at least 35nM, at least 40nM, at least 
45nM or at least 50nM. 

Examples of monooxygenase enzymes include, but are not limited to, 
cytochrome P450 monooxygenases and non-heme di-iron monooxygenase enzymes. 
Suitable non-heme di-iron monooxygenase enzymes include, but are not limited to 
methane monooxygenase (Colby et aL, Biochem. J. ? 1977; 165: 395-402; Dalton, 
Adv. Appl. Microbiol., 1980; 26: 71-87; Fox etaL, J. Biol. Chem., 1989; 264: 
10023-10033; Fox et aL, Methods Enzymol., 1990; 188: 191-202; McDonald et aL, 
Appl. Environ. Microbiol., 1997; 63: 1898-1904), alkane hydroxylase (vanBeilen et 
aL, Enzyme Microb. TechnoL, 1994; 16: 904-911), toluene monooxygenase (Luykx 
et aL, Biochem. Biophys. Res. Commun., 2003; 312: 373-379; Pikus et aL, 
Biochemistry, 1996; 35: 9106-9119; Newman & Wackett, Biochemistry, 1995; 34: 
14066-14076), alkene monooxygenase (Gallagher et aL, Eur. J. Biochem., 1997; 
247: 635-641; Lange & Que, Curr. Opin. Chem. Biol., 1998; 2: 159-172; Zhou et aL, 
FEBS Lett., 1998; 430: 181-185), phenol monooxygenase (Divari et aL, Eur. J. 
Biochem., 2003; 270: 2244-2253) and steroid desaturase (Shanklin et aL, 
Biochemistry, 1994; 33: 12787-12794). The non-heme di-iron monooxygenase 
enzymes are typically of eukaryotic or prokaryotic origin and preferably of bacterial, 
fungal, yeast, plant or animal origin. Preferred sequences are shown in SEQ ID NOs: 
1 to 36. 

The enzyme used in the methods of the invention is preferably a cytochrome 
P450 enzyme, typically of eukaryotic or prokaryotic origin. Cytochrome P450 
monooxygenases are typically characterised by a 446-450 nm heme Soret band for 
the ferrous-carbon monoxide complex. The enzyme is generally of bacterial, fungal, 
yeast, plant or animal origin, and thus may be from a bacterium of the genus 
Psendomonas. The enzyme may be a naturally-occurring form of P450, such as 
P450 cam? P450bm-3 from Bacillus megaterhim, P450 terp from Pseudomonas sp, 
P450 ery F from Saccharopollyspora erythraea and also P450 105 Dl (CYP105) from 
Streptomyces griseus strains. 
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Alternatively, the enzyme may be a mutant of a naturally-occurring form of 
P450. The mutants retain the essential biological activity of the naturally-occurring 
enzyme, namely the ability to catalyse an oxidation reaction using H 2 0 2 . The mutant 
may have one or more mutations in the active site of the enzyme. 
5 An amino acid 'in the active site' is one which lines or defines the site in 

which the substrate is bound during catalysis or one which lines or defines a site 
through which the substrate must pass before reaching the catalytic site. Therefore 
such an amino acid typically interacts with the substrate during entry to the catalytic 
site or during catalysis. Such an interaction typically occurs through an electrostatic 

1 0 interaction (between charged or polar groups), hydrophobic interaction, hydrogen 
bonding or van der Waals forces. 

The amino acids in the active site can be identified by routine methods to . 
those skilled in the art. These methods include labelling studies in which the enzyme 
is allowed to bind a substrate which modifies ('labels') amino acids which contact 

15 the substrate. Alternatively the crystal structure of the enzyme with bound substrate 
can be obtained in order to deduce the amino acids in the active site. 

The monooxygenase enzyme may have 1, 2, 3, 4, 5 to 10, 10 to 20 or more 
other mutations, such as substitutions, insertions or deletions. Amino acid 
substitutions may be made to the amino acid sequence of a naturally-occurring 

20 enzyme, for example from 1, 2, 3, 4 or 5 to 10, 20 or 30 substitutions. Conservative 
substitutions may be made, for example, according to Table 1 . Amino acids in the 
same block in the second column and preferably in the same line in the third column 
may be substituted* for each other: 

25 Table 1 - Conservative amino acid substitutions 
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The mutations may be in the active site or outside the active site. Typically 
the mutations are in the 'second sphere' residues which affect or contact the position 
or orientation of one or more of the amino acids in the active site. The insertion is 
typically at the N and/or C terminal and thus the enzyme may be part of a chimeric 
protein. The deletion typically comprises the deletion of amino acids which are not 
involved in catalysis, such as those outside the active site (thus the enzyme is a 
mutated fragment of a naturally occurring enzyme). The monooxygenase enzyme 
may thus comprise only those amino acids which are required for oxidation activity. 

The mutation in the active site typically alters the position and/or 
conformation of the substrate when it is bound in the active site. The mutation may 
make the site on the substrate which is to be oxidized more accessible to the heme 
group. Thus the mutation may be a substitution to an amino acid which has a smaller 
or larger, or more or less polar, side chain. 

The mutations typically increase the stability of the protein, or make it 
easier to purify the protein. They typically prevent the dimerisation of the protein, 
typically by removing cysteine residues from the protein (e.g. by substitution of 
cysteine at position 334 of P450 cam , or at an equivalent position in a homologue, 
preferably to alanine). They typically allow the protein to be prepared in soluble 
form, for example by the introduction of deletions or a poly-histidine tag, or by 
mutation of the N-terminal membrane anchoring sequence. The mutations typically 
inhibit protein oligomerisation, such as oligomerisation arising from contacts 
between hydrophobic patches on protein siirfaces. 

The mutations may affect the manner in which the enzyme utilises H 2 0 2 and 
thereby improve the efficiency of the reaction. For example, mutants of the P450 
enzyme from Psendornonas putida hydroxylate napthalene through the "peroxide 
shunt" with more than a 20-fold increase in the activity of the enzyme (Joo et aL, 
Nature, 1999; 399(6737): 636-637). In addition, replacement of all the methionine 
residues of the heme domain of P450 B m-3 with norleucine results in a two-fold 
increase in the peroxygenase activity of the enzyme (Cirino et al , Biotechnpl. 
Bioeng., 2003; 83(6): 729-734). Furthermore, direct evolution studies to find mutants 
of enzymes more resistant to peroxide (Cirino & Arnold, Angew. Chem. Int. Ed., 

2003;42:3299-3301). 

Thus the mutant enzyme is typically at least 70% homologous to a naturally 

occurring enzyme on the basis of amino acid identity. 



A mutant protein (i.e. described as being a mutant of another protein) 
mentioned herein is typically at least 70% homologous to the relevant protein or at 
least 80 or 90% and more preferably at least 95%, 97% or 99% homologous thereto 
over at least 20, preferably at least 30 5 for instance at least 40, 60 or 100 or more 
contiguous amino acids. The contiguous amino acids may include the active site. 
This homology may alternatively be measured not over contiguous amino acids but 
over only the amino acids in the active site. 

Homology can be measured using known methods. For example the 
UWGCG Package provides the BESTFIT program which can be used to calculate 
homology (for example used on its default settings) (Devereux et al (1984) Nucleic 
Acids Research 12, p3 87-395). The PILEUP and BLAST algorithms can be used to 
calculate homology or line up sequences (typically on their default settings), for 
example as described in Altschul S. F. (1993) J Mol Evol 36:290-300; Altschul, S, F 
et al (1990) J Mol Biol 215:403-10. 

Software for performing BLAST analyses is publicly available through the 
National Center for Biotechnology Information {http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pair (HSPs) by 
identifying short words of length W in the query sequence that either match or satisfy 
some positive-valued threshold score T when aligned with a word of the same length 
in a database sequence. T is referred to as the neighbourhood word score threshold 
(Altschul et al, supra). These initial neighbourhood word hits act as seeds for 
initiating searches to find HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be 
increased. Extensions for the word hits in each direction are halted when: the 
cumulative alignment score falls off by the quantity X from its maximum achieved 
value; the cumulative score goes to zero or below, due to the accumulation of one or 
more negative-scoring residue alignments; or the end of either sequence is reached. 
The BLAST algorithm parameters W, T and X determine the sensitivity and speed of 
the alignment. The BLAST program uses as defaults a word length (W) of 1 1, the 
BLOSUM62 scoring matrix (see Henikoff and Henikoff (1992) Proc. Natl Acad. 
Set USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=4, 

» 

and a comparison of both strands. 

The BLAST algorithm performs a statistical analysis of the similarity 
between two sequences; see e.g., Karlin and Altschul (1993) Proc. Natl. Acad. Set 
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USA 90: 5873-5787. One measure of similarity provided by the BLAST algorithm 
is the smallest sum probability (P(N)) 5 which provides an indication of the 
probability by which a match between two nucleotide or amino acid sequences 
would occur by chance. For example, a sequence is considered similar to another 
sequence if the smallest sum probability in comparison of the first sequence to the 
second seqiience is less than about 1, preferably less than about 0.1, more preferably 
less than about 0.0 1, and most preferably less than about 0.001. 

Mutants include fragments of the above-mentioned sequences. Such 
fragments retain monooxygenase activity. Fragments may be at least 300, at least 
400 or at least 450 amino acids in length. Such fragments may be used to produce 
chimeric enzymes as described in more detail below. 

Mutants also include chimeric proteins comprising fragments or portions of a 
naturally-occurring enzyme. One or more amino acids may be alternatively or 
additionally added to the polypeptides described above. An extension may be 
provided at the N-terminus or C-terminus of the naturally-occurring enzyme or 
variant or fragment thereof. The extension may be quite short, for example from 1 to 
1 0 amino acids in length. Alternatively, the extension may be longer. A carrier 
protein may be fused to an amino acid sequence described above. A fusion protein 
incorporating one of the enzymes described above can thus be used in the invention. 

The naturally-occurring enzyme or mutant thereof may also be chemically- 
modified. A number of side chain modifications are known in the art and may be 
made to the side chains of the enzymes discussed above. Such modifications 
include, for example, glycosylation, phosphorylation, modifications of amino acids 
by reductive alkylation by reaction with an aldehyde followed by reduction with 
NaBH 4 , amidination with methylacetimidate or acylation with acetic anhydride. The 
modification is preferably glycosylation. 

The mutations discussed herein are generally introduced into the enzyme by 
using methods known in the art, such as site directed mutagenesis of the enzyme, 
PCR and gene shuffling methods or by the use of multiple mutagenic 
oligonucleotides in cycles of site-directed mutagenesis. Thus the mutations may be 
introduced in a directed or random manner. The mutagenesis method thus produces 
one or more polynucleotides encoding one or more different mutants. Typically a 
library of mutant oligonucleotides is produced which can be used to produce a library 
of mutant enzymes. 
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The enzyme may be made synthetically or by recombinant means using 
methods known in the art. The amino acid sequence of the monooxygenase enzyme 
may be modified to include non-naturally occurring amino acids or to increase the 
stability of the enzyme. When the enzyme is produced by synthetic means, such 
amino acids may be introduced during production. The proteins or peptides may also 
be modified following either synthetic or recombinant production. 

The enzyme may also be produced using D-amino acids. In such cases the 
amino acids will be linked in reverse sequence in the C to N orientation. This is 
conventional in the art for producing such proteins or peptides. 

The enzyme may be produced in a cell by in situ expression of the 
polypeptide from a recombinant expression vector. The expression vector optionally 
carries an inducible promoter to control the expression of the polypeptide. The 
enzyme may be produced in large scale following purification by any protein liquid 
chromatography system after recombinant expression. Preferred protein liquid 
chromatography systems include FPLC, AKTA systems, the Bio-Cad system, the 
Bio-Rad BioLogic system and the Gilson HPLC system. 

Oxidation reaction 

The methods of the invention concerns carrying out a high efficiency 
oxidation reaction catalysed by a monooxygenase enzyme. A high efficiency 
oxidation reaction is a reaction that occurs without an appreciable reduction in the 

4. 

enzyme turnover or product yield or inactivation of the monooxygenase enzyme. 
Preferably, the monooxygenase enzyme displays at least 70%, at least 80%, at least 
90%, at least 95% or 100% of the activity shown at the beginning of the reaction 
after 1 hour, 2 hours, 6 hours, 12 hours, 1 day, 2 days or 5 days. 

Typically the methods of the invention are carried out in vitro, such as in a 
cell free system. 

The reaction is driven by the "peroxide shunt". The reaction of the invention 
is carried out in the presence of the monooxygenase enzyme (a), the substrate (b) and 
H2O2 (c). The reaction is typically performed in aerobic conditions and does not 
require any cofactors. The production of (c) is discussed in more detail below. In this 
system the flow of electrons is typically: (c) — > (a) — ► (b). 

In the methods the concentration of (a) and (b) is typically from 10" 8 to 10" 
2 M, preferably from 10" 6 to 10" 4 M. Typically the ratio of concentrations of (a): (b) is 
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from 0.1:10 to 1:10, preferably from 1:0.5 to 1:2, or from 1:0.8 to 1:1.2. Preferably, 
the concentration of (b) is greater than the concentration of (a). The preferred 
concentration of (a) is that which when reacted with substrate will generate sufficient 
product to be detected by available analytical methods e.g. GC 5 HPLC. This is 
- typically of the order of jaM quantities. 

Generally the methods are carried out at a temperature and/or pH at which 
the monooxygenase enzyme is functional, such as when the enzyme has at least 20%, 
50%, 80% or more of peak activity. Typically the pH is from 2 to 1 1, such as from 5 
to 9 or from 6 to 8, preferably from 7 to 7.8 or 7.4. The pH can be maintained using 
a suitable buffering agent such as phosphate or acetate based systems. Typically the 
temperature is from 0 to 80°C, such as from 25 to 75 °C, from 30 to 60°C or from 
50°C to 80°C. Preferably, the temperature is from 20 to 40°C. 

Typically in the methods at least 20 turnovers/min occur, such as at least 50, 
100, 200, 300, 500 or more turnovers (turnover is measured as nanomoles of product 
formed per nanomole of enzyme). 

Typically, the rate of H 2 0 2 production is less than or equal to 1, 2 or 3 jag 
per min per mg of monooxygenase enzyme. Typically, the concentration of H2O2 
throughout the reaction is less than or equal to 0.1, 0.5 or ImM. Typically, the 
reaction continues for at least 60 minutes, at least 240 minutes, at least 6 hours or at 
least 10 hours. 

The methods of the invention may be carried out in the monooxygenase 
substrate if it is a liquid under the reaction conditions. The methods of the invention 
may also be conducted in a solvent. Suitable solvents include, but are not limited to, 
water, aqueous buffer solutions mixed water/organic and aqueous buffer/organic 
solvent systems. Preferably, the organic solvent is a hydrocarbon such as hexane, 
benzene, acetonitrile, lower aliphatic alcohols, ketones and dioxane, 
dimethylformamide and dimethylsulphoxide and mixtures thereof. The solvent is 
typically one in which the reagants and products are highly soluble and one that 
maintains the stability and activity of the monooxygenase enzyme. 

The reaction may be carried out in a homogenous system with all the 
components in solution. Typically, the monooxygenase enzyme and substrate are 
mixed together in a suitable solvent in a stirred tank reactor and the reaction is 
conducted in batch, semi-batch or continuous mode. 



Alternatively, the raonooxygenase enzyme may be immobilized on a 
suitable solid support, such as silica, prior to carrying out the method of the 
invention. An immobilized monooxygenase enzyme can be packed into a fixed bed 
reactor and the substrate passed over the enzyme. In one embodiment, the enzyme 
5 producing the H2O2 (discussed in more detail below) may be immobilized on the 
same or different material as the monooxygenase enzyme. Procedures for 
immobilizing enzymes are known in the art. Examples of such procedures include, 
but are not limited to, covalent coupling to insoluble organic or inorganic supports, 
entrapment in gels and adsorption to ion exchange resins or other adsorbent 

10 materials. (G. F. Bickerstaff ed., "Immobilization of Enzymes and Cells," Humana 
Press, Totowa. New Jersey, 1997). 

In a further embodiment, a membrane on the "entry" side admits the 
substrate slowly from the "reactant" side and then a hydrophilic membrane on the 
"exit" side allows hydrophilic compounds to flow out to the "product" side of the 

15 flow reaction cell. In this case the H2O2 may be generated outside the membrane and 
allowed to flow through the membrane to the mobile or immobile enzyme. 

In one embodiment, H2O2 is preferably produced by one of the methods 
discussed in more detail below. In another embodiment, a H 2 0 2 or hydroxyl radical 
sequestering agent is used to sequester excess H2O2 or hydroxyl radical during the 

20 oxidation reaction. The sequestering agent may be a chelating agent. In one 

embodiment, the chelating agent is EDTA. The EDTA inhibits production of the 
hydroxyl radical, for example, produced by the reaction of trace amounts of iron (or 
copper) with the H2O2. 

25 H2O2 production by an electrochemical reaction 

The H2O2 may be produced in the method of the invention by an 
electrochemical reaction. An electrochemical reaction is generally a means for 
introducing a current to a liquid, preferably a solution. An electrochemical reaction is 
typically an oxidation or reduction reaction that takes place at an electrode through 

30 which a current flows. An electrode is a solid capable of conducting electricity, 

typically carbon-based or metallic, leading to an external source or sink which is in 
contact with the liquid, preferably a solution. The electrode may be either positively 
charged (cathode) or negatively charged (anode). Two or more electrodes may form 
an electrochemical cell from which an external wire can lead from each electrode to 
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an external electrical device. An oxidation or reduction reaction takes place at one 
electrode, while a redox reaction can take place either in an electrochemical cell or 
directly in the liquid. 

Production of H2O2 using an electrochemical reaction is energy efficient. 
5 H2O2 is typically produced by the controlled electrochemical reduction of molecular 
oxygen to hydrogen peroxide. The surface area and the overpotential of the cathode 
are key considerations for the two-electron reduction of molecular oxygen to 
hydrogen peroxide. Typically, carbon-based cathodes are used and they may be 
modified with a compound known to lower the overpotential for this reaction. 

10 Electrode materials and modifiers which will perform this task effectively and 

efficiently are well known in the art. The reduction of O2, and hence production of 
hydrogen peroxide, can typically be controlled by the potential applied to the 
cathode. The potential applied to the cathode will vary depending on the cathode and 
any modifications to the cathode made. 

15 The electrochemical reaction used in the method of the invention may be the 

sonoelectrochemical reduction of dioxygen. This method is well known in the art 
(Compton et aL, Electroanalysis, 1997; 9(7): 509-522). 

H2O2 production by an enzyme 

20 The H2O2 may be produced in the method of the invention by an enzyme. 

The enzyme is preferably an oxidase. Examples of suitable oxidases include, but are 
not limited to, glucose oxidase (E.G. 1.1.3.4), secondary-alcohol oxidase (E.G. 
1.1.3.18), methanol oxidase (E.G. 1.1.3.31), oxalate oxidase (E.C. 1.2.3.4), aryl- 
aldehyde oxidase (E.C. 1.2.3.9), carbon monoxide oxidase (E.G. 1.2.3.10), amine 

25 oxidase (E.C. 1.4.3.4), ethanolamine oxidase (E.C. 1.4.3.8), nitroethane oxidase 
(E.C. 1.7.3.1) and sulfite oxidase (E.C. 1.8.3.1). Glucose oxidase (E.C. 1.1.3.4) 
catalyzes the conversion of D-glucose to D-glucono-l,5-lactone and H2O2. 
Secondary-alcohol oxidase (E.C. 1.1.3.18) catalyzes the conversion of a secondary 
alcohol to a ketone and H2O2. Methanol oxidase (E.C. 1.1.3.31) catalyzes the 

30 conversion of methanol to formaldehyde and H2O2. Oxalate oxidase (E.C. 1.2.3.4) 
catalyzes the conversion of oxalate to carbon dioxide and H2O2. Aryl-aldehyde 
oxidase (E.C. 1.2.3.9) catalyzes the conversion of an aromatic aldehyde to an 
aromatic acid and H2O2. Carbon monoxide oxidase (E.C, 1.2.3,10) catalyzes the 
conversion of CO and H2O to carbon dioxide and H2O2. Amine oxidase (E.C. 
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1.4.3.4) catalyzes the conversion of RCH 2 NH 2 and H 2 0 to RCHO andNH 3 and 
H2O2. Ethanolamine oxidase (E.G. 1.4.3.8) catalyzes the conversion of ethanolamine 
and H2O to glycolaldehyde and H2O2. Nitroethane oxidase (E.G. 1.7.3.1) catalyzes 
the conversion of nitroethane and H 2 0 to acetaldehyde and H 2 0 2 . Sulfite oxidase 
(E.C. 1.8.3.1) catalyzes the conversion of sulfite and H2O2 to sulfate and H2O2. The 
oxidase may be purchased commercially (e.g., glucose oxidase). Alterantively, the 
oxidase can be extracted from known microorganisms using procedures known in the 
art. 

The substrate for the oxidase will be well known in the art. In addition to the 
substrate, the reaction to produce H2O2 will also involve water. Typically, a H2O2- 
activating metal is also included in the reaction. Suitable metals inqlude, but are not 
limited to, cerium, chromium, cobalt, copper, iron, manganese, molybdenum, silver, 
titanium, tungsten, vanadium and mixtures thereof. Metallosilicates containing the 
above metals can be prepared and. used in the method of the invention. The procedure 
for producing such metallosilicates in known in the art (Neumann et al, Journal of 
Catalysis, 1997; 166: 206-127). The metallosilicate is preferably tetrahedrally 
coordinated titanium such as silicalite-1 (TS-1), silicalite-2 (TS-2), zeolite-beta, 
silicon analogs of ZSM-48 and MCM-4 1 . (Murugavel and Roesky, Angew. Chem. • 
Int. Ed. Engl., 1997; 36(5): 477-479). 

In a preferred embodiment of the invention, the metal-containing solid or 
metallosilicate is used as a support upon which the HkCVproducing enzyme is 
immobilized. In another preferred embodiment, the monooxygenase enzyme is also 
immobilized on the same or different metallosilicate.support. 

Preferably, the oxidase is first mixed with the other reaction components 
and then the reaction is initiated by addition of the oxidase substrate. For example, 
the monooxygenase enzyme, monoxygenase enzyme substrate and oxidase are all 
mixed and then the oxidase enzyme is added. In a preferred embodiment, P450bm3> 
octane and glucose oxidase are mixed together and then glucose added. Control of 
H 2 0 2 generation can typically be accomplished by controlling the rate at which the 

* 

oxidase substrate is added. 

H0O2 production by a precursor 

The H2O2 may be produced in the method of the invention by a precursor. 
The generation of H2O2 by the addition of a precursor to water is well known in the 
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art. Precursors include, but are not limited to, salts of perborate, salts of 
percarbonate, salts of perphosphate and peroxynitrite. Preferred precursors are 
sodium salts. The H202-producing properties of the precursor may be enhanced by 
using a compound such as tetraacety lethylenediamine . The amount of precursor 
added to the solution containing the monoxygenase enzyme and substrate is such to 
maximise the enzymatic reaction with the substrate and to minimise the deactivation 
of the enzyme by H2O2. Preferably the concentration of H2O2 produced does not 
exceed the K m value for the enzyme but is sufficient to generate the enzyme reactive 
species. 

Examples 
Example J 

In this experiment, octane was reacted with electrochemically generated 
H2O2 in the presence of P450bm3 heme domain. The experiment was performed at 
room temperature with a three-electrode configuration in a 1 00 mL glass beaker. The 
reticulated vitreous carbon (RVC) cathode, platinum gauze anode and Ag/AgCl 
reference electrode were contained in the one vessel. The RVC cathode was briefly 
immersed in a 1 mM 2-aminoanthraquinone ethanolic solution before being removed 
and allowed to dry in air. The reaction solution contained aqueous Tris buffer (50 
mM, pH 7.4) saturated with oxygen, 0-.2 M KC1, 0.5 mM octane, and 3 \xM P450 B M3 
F87V L188Q A74G heme domain. The reaction solution was stirred to equilibrate 
(5-10 minutes) and then a potential of -0.55 V vs Ag/AgCl was applied for 2 hours 
and the solution stirred continuously throughout. GC analysis revealed the presence 
of the solvent chloroform, octane, 2-, 3- and 4-octanol and the internal standard 1- 
nonanol. The relative proportion of 2, 3 & 4-octanol was 1:1.1:0.7. The total 
concentration of octanols formed was 141 jliM, representing a turnover per enzyme 
of 47. 

A similar experiment was performed with 1.43 jjM wild-type P450bm3 heme 
domain. The total concentration of octanols formed was 8.4 pM,. representing a 
turnover per enzyme of 6. The relative proportion of 2, 3 & 4-octanol in this case 
was 1:1.7:2.0. 
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Example 2 

In this experiment, octane was reacted with enzymatically generated H 2 0 2 
in the presence of P450 BM3 holoenzyme. Into -a glass vial was added a solution (total 
volume 5 mL) consisting of aqueous Tris buffer (50 mM, pH 7.4), 0.5 mM octane, 
1.6 uM P450 B M3 F87V L188Q A74G holoenzyme and glucose oxidase (1.5 U). 
After equilibration (5 mins), the reaction was initiated by addition of glucose (1x10" 
6 moles). Successive additions of glucose (1 x 10" 6 moles) were made every 5 
minutes up to 1 hour (total of 12 additions equivalent to 1.2 x 10" 5 moles). The 
reaction was stirred continuously during this time and stopped after 1.5 hours. GC 
analysis revealed the presence of the solvent chloroform, octane, 2-, 3- and 4-octanol 
and the internal standard 1-nonanol. The relative proportion of 2, 3 & 4-octanol was 
1:1.1:0.8. The total concentration of octanols formed was 17 uM, representing a 
turnover per enzyme of 1 0. 

Example 3 

In this experiment, octane was reacted with H 2 0 2 derived from sodium 
perborate, in the presence of P450 BM 3 holoenzyme. Into a glass vial was added a 
solution (total volume 5 mL) consisting of aqueous Tris buffer (40 mM, pH 7.4), 0.5 
mM octane, and 1.3 uM P450 BM3 F87V L188Q A74G holoenzyme. After 
equilibration (5 mins), the reaction was initiated by addition of NaB0 3 .4H 2 0 (1x10" 
4 moles) and stirred continuously for 1 hour. GC analysis revealed the presence of the 
solvent chloroform, octane, 2-, 3- and 4-octanol and the internal standard 1-nonanol. 
The relative proportion of 2, 3 & 4-octanol was 1:1.8:1.1. The total concentration of 
octanols formed was 77 uM, representing a turnover per enzyme of 59. 

For Examples 1 to 3, no octanol products were observed when the P450 
enzyme was absent from the solution. 

Example 4 

In this experiment, pinene was reacted with H 2 0 2 derived from sodium 
perborate, in the presence of P450 BM 3 heme domain. Into a glass, vial was added a 
solution (total volume 5 mL) consisting of aqueous Tris buffer (40 mM, pH 7.4), 
0.63 mM pinene, and 3.7 uM wild-type P450 BM 3 heme domain. After equilibration (5 

w 

mins), the reaction was initiated by addition of 7.8 x 10" 6 moles NaBQ 3 .4H 2 0 and 
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stirred continuously for 1 hour. GC analysis revealed the presence of cis/trans 2,3- 
epoxides (32%), (+)-/row-verbenol (16%), (+)-cw-verbenol (6%), (+)- 
verbenone/(+)-myrtenol (13%), myrtenal (4%), as well as unidentified further 
oxidation products (29%). The total concentration of products formed was 80 uM, 
representing a turnover per enzyme of 22. 

Example 5 

In this experiment, phenol monooxygenase is reacted with phenol in the 
presence of with H 2 0 2 generated by sodium perborate. Into a glass vial is added a 
solution (total volume 5 mL) consisting of aqueous Tris buffer (40 mM, pH 7.4), 
0.63 mM phenol, and 3.7 uM wild-type phenol monooxygenase. After equilibration 
(5 mins), the reaction is initiated by addition of 7.8 x 10" 6 moles NaB0 3 .4H 2 0 and 
stirred continuously for 1 hour. GC analysis reveals the presence of oxidation 
products. 

Example 6 

In this experiment, P450 BM3 is reacted with palmitic acid in the presence of 
H 2 0 2 generated by glucose oxidase. Into a glass vial is added a solution (total volume 
5 mL) consisting of aqueous Tris buffer (50 mM, pH 7.4), 0.5 mM palmitic acid, 1.6 
uM P450bm3 holoenzyme and glucose oxidase (1.5 U). After equilibration (5 mins), 
the reaction is initiated by addition of glucose (1 x 10" 6 moles). Successive additions 
of glucose (1 x 10" 6 moles) are made every 5 minutes up to 1 hour (total of 12 
additions equivalent to 1 .2 x 10" 5 moles). The reaction is stirred continuously during 
this time and stopped after 1.5 hours. GC analysis reveals the presence of oxidation 
products. 

Example 7 

Plant CYP74C is reacted with 13 S-hydroperoxylinolenic acid to form the 
compound 3Z-hexenal (a fragrance). The H 2 0 2 is generated by sodium perborate. 
Into a glass vial is added a solution (total volume 5 mL) consisting of aqueous Tris 
buffer (40 mM, pH 7.4), 0.63 mM 13 S-hydroperoxylinolenic acid, and 3.7 uM wild- 
type plant CYP74C. After equilibration (5 mins), the reaction is initiated by addition 
of 7.8 x 10" 6 moles NaB0 3 .4H 2 0 and stirred continuously for 1 hour. GC analysis 



reveals the presence of oxidation products. 



20 



SEQUENCE LISTING 



<110> ISIS INNOVATION LIMITED 

<120> HYDROGEN PEROXIDE OXIDATION 

<130> N. 91079 SA 

<160> 36 

<170> Patentln version 3.2 

<210> 1 

<211> 1248 

<212> DNA 

<213> Pseuciomonas put i da 



<220> 
<221> 
<222> 



CDS 

(1) . . (1248) 



ata acq act gaa acc ata caa age aac gec aat ctt gec cat ctg cca 

Me? Thr Tta Glu Tar He Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro 
1 5 10 

cec cat atg cca gag cac ctg gta ttc gac ttc gac atg tac aat ccg 

Pro His va? Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro 

20 25 J 

tra aat ctg tct gec ggc gtg cag gag gec tgg gca gtt ctg caa gaa 
teg aat cx.g u^l. yy a i a = =» ^ L Gln Glu 

Ser Asn Leu Ser Ala Gly Val Gin Glu Ala Trp AJ.a 

35 40 45 

tea aac gta ccg gat ctg gtg tgg act ege tgc aac ggc gga cac tgg 
Ser Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp 
50 55 • 60 

acc act cgc ggc caa ctg ate cgt gag gec tat gaa gat tac cgc 
He Ala Thr Arg Gly Gin Leu He Arg Glu Ala Tyr Glu Asp Tyr Arg 
65 70 75 

cac ttt tec age gag tgc ccg ttc ate ect cgt gaa gee ggc gaa gec 
His Phe Ser Ser Glu Cys Pro Phe He Pro Arg Glu Ala Gly Glu Ala 

85 9° 9b 

tac gac ttc att cec ace teg atg gat ccg ecc gag cag cgc cag ttt 
Tyr Asp Phe He Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe 

100 

™f nra eta acc aac caa gtg gtt ggc atg ccg gtg gtg gat aag ctg 
S Ala Leu 111 Asn Gin U Val Gly Met Pro Val Val Asp Lys Leu 
115 120 I 25 

gag aac egg ate cag gag ctg gec tgc teg ctg ate gag age ctg cgc 
Glu Asa Arg He Gin Glu Leu Ala Cys Ser Leu lie Glu Ser Leu Arg 
130 135 140 

ccg caa gga cag tgc aac ttc acc gag gac tac gee gaa cec ttc ccg 
Pro Gin Gly G.ln Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro 
145 I 50 15 

ata cgc ate ttc atg ctg etc gca ggt eta ccg gaa gaa gat ate ccg 
lie Arg He Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp lie Pro 

165 I 70 

cac ttg aaa tac eta acg gat cag atg acc cgt ccg gat ggc age atg 
Hds Leu Lys Tyr Leu Thr Asp Gin Met Thr Arg Pro Asp Gly Ser Met 

180 185 

acc ttc gca gag gee aag gag gcg etc tac gac tat ctg ata ccg ate 
Thr Phe Ala Glu La Lys Glu Ala Leu Tyr Asp Tyr Leu He Pro He 
195 200 205 

ate gag caa cgc agg cag aag ccg gga acc gac get ate age ate gtt 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



21 



lie Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala lie Ser He Val 

215 220 

gcc aac ggc cag gtc aat ggg cga ccg ate acc agt gac gaa acc aaa 
Ala Asn Gly Gin Val Asn Gly Arg Pro He Thr Ser Lp Glu 111 Lys 

230 " 235 240 

tra till clt r? C r^ 9 r"* Ctg gtC 9gC ggc ctg gat ac 9 fltc aat 

Arg Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn 

245 250 255 

" C ? tC agc " c agc at 9 ^g ttc otg gcc aaa age ccg gag cat cgc 
Phe Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg 



265 270 



Gin rfn t 9 rV ?? 9 ° 9 CC ° gSg Cgt att cca gcc ? ct tgc gag gaa 
Gin Glu Leu He Glu Arg Pro Glu Arg lie Pro Ala Ila Cys Ilu Ilu 

280 285 " 

tI? »" * 9 ° " C tCg Ct9 gtt gcc gat <3? c ^ ate etc acc tec 
Leu Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg He Leu Thr Ser 

ZSU 295 300 

f at Hf 9 t ^ t Cat ggc gtg caa at< 3 aa S aaa ggt gac cag ate eta 

Asp Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Lp Gin lie Leu 

315 320 

eta ccg cag atg ctg tct ggc ctg gat gag cgc gaa aac gcc tgc cca 
Leu Pro Gin Met Leu Ser Gly Leu .Asp Glu Arg Glu Asn Ila Cys Pro 

325 330 w 335 

Se? His Val Ph^ * g ° °" ** g gtt tCa CaC acc * cc «:t ggc 

Met H 1S Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly 

340 345 35 0 

cac ggc age cat ctg tgc ctt ggc. cag cac ctg gcc cgc egg gaa ate 
Hxs Gly Ser His Leu Cys Leu Gly Gin His Leu La Arg A^g llu lie 
355 360 365 

He Val T^r LeS ^ I" T Ctg aC ° agg att CCt gac ttc tcc •« 

lie Val Thr Leu Lys Glu Trp Leu Thr Arg He Pro Asp Phe Ser He 

J/u 375 380 

gcc ccg ggt gcc cag att cag cac aag agc ggc ate gtc age aac ate 
Ala Pro Gly Ala Gin He Gin His Lys sL Sly He Val Ser Ily^al 

390 395 400 

rfn In ! i CtC S Ct Ctg gtC tgg gat ccg ^ act acc gcg gta taa 

Gin Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 

405 410 " 415 



<210> 2 

<211> 415 

<212> PRT 

<213> Pseudomonas putida 

<400> 2 



Met Thr Thr Glu Thr He Gin Ser Asn Ala Asn Leu Ala Pro Leu Pro 
15 10 15 

Pro His Val Pro Glu His Leu Val Phe Asp Phe Asp Met Tyr Asn Pro 

20 25 3 0 

Ser Asn Leu Ser Ala Gly Val Gin Glu Ala Trp Ala Val Leu Gin Glu 
J5 40 45 

Ser Asn Val Pro Asp Leu Val Trp Thr Arg Cys Asn Gly Gly His Trp 
su 55 60 

lie Ala Thr Arg Gly Gin Leu He Arg Glu Ala Tyr Glu Asp Tyr Arg 

70 75 SO 

His Phe Ser Ser Glu Cys Pro Phe He Pro Arg Glu Ala Gly Glu Ala 

85 90 



720 



768 



816 



8 64 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



95 



22 



Tyr Asp Phe lie Pro Thr Ser Met Asp Pro Pro Glu Gin Arg Gin Phe 

100 105 110 

Arg Ala Leu Ala Asn Gin Val Val Gly Met Pro Val Val Asp Lys Leu 
115 120 125 

Glu Asn Arg lie Gin Glu Leu Ala Cys Ser Leu He Glu Ser Leu Arg 
130 135 " 140 

Pro Gin Gly Gin Cys Asn Phe Thr Glu Asp Tyr Ala Glu Pro Phe Pro 
145 150 155 160- 

Ile Arg He Phe Met Leu Leu Ala Gly Leu Pro Glu Glu Asp He Pro 

165 170 175 

His Leu Lys Tyr Leu Thr Asp Gin Met Thr Arg Pro Asp Gly Ser Met 

180 ^ 185 190 

Thr Phe Ala Glu Ala Lys Glu Ala Leu Tyr Asp Tyr Leu lie Pro He 
195 200 205 

He Glu Gin Arg Arg Gin Lys Pro Gly Thr Asp Ala He Ser He Val 
210 215 " 220 

Ala Asn Gly Gin Val Asn Gly Arg Pro He Thr Ser Asp Glu Ala Lys 
225 230 235 240 

Arg Met Cys Gly Leu Leu Leu Val Gly Gly Leu Asp Thr Val Val Asn 

245 250 " 255 

Phe Leu Ser Phe Ser Met Glu Phe Leu Ala Lys Ser Pro Glu His Arg 

260 265 270 

Gin Glu Leu He Glu Arg Pro Glu Arg He Pro Ala Ala Cys Glu Glu 
275 280 285 ' 

Leu Leu Arg Arg Phe Ser Leu Val Ala Asp Gly Arg He Leu Thr Ser 
290 295 300 

Asp Tyr Glu Phe His Gly Val Gin Leu Lys Lys Gly Asp Gin He Leu 
305 310 315 320 

Leu Pro Gin Met Leu Ser Gly Leu Asp Glu Arg Glu Asn Ala Cys Pro 

325 330 335 

Met His Val Asp Phe Ser Arg Gin Lys Val Ser His Thr Thr Phe Gly 

340 345 350 

His Gly Ser His Leu Cys Leu Gly Gin His Leu Ala Arg Arg Glu He 
355 360 365 

He Val Thr Leu Lys Glu Trp Leu Thr Arg He Pro Asp Phe Ser He 
370 375 380* 

Ala Pro Gly Ala Gin He Gin His Lys Ser Gly He Val Ser Gly Val 
385 390 395 400 

Gin Ala Leu Pro Leu Val Trp Asp Pro Ala Thr Thr Lys Ala Val 

405 410 415 



<210> 3 

<211> 4957 

<212> DNA 

<213> Bacillus inegaterium 



<220> 
<221> CDS 

<222> (1541) . . (4690) 
<400> 3 

agatctttat gaagacatag ctgcagaaga aaaagcaaga gctacatatc aatggttaat 60 
tgatatatca gatgatcccg atttaaacga cagcttacga tttttacgag aaagagagat 120 
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tgttcactca 


cagcggttcc 


gcgaggccgt 


ggagatttta aaagatgaca 


gagacaggaa 


180 


gaaaatcttt 


taactagtaa 


aaaaacatcc 


cccttggcga atgcaaacga 


aaggagggat 


240 


gttttttgtt 


gtgactgcgt tgattatgcg ctagaactgc agtgacaaga 


aacaaccttt 


300 


aatttccctt 


caacatcttt 


ccaaactcgc 


gtataactgt attcacctcc 


aatagattca 


360 


ccggttgcca 


gtgccccatt 


taacgctact 


tttgtaacgg taacggcaag 


ttcttgaaac 


420 


agtttaactt 


cttgttccaa 


cacttccatg 


cccgctatat caagactttt 


tgaacgatga 


480 


acatttatat 


cttcttcttt 


tgacaaccat 


tgcccaaggt gattcacaaa 


aataagctca 


540 


tctgaaagta 


attcttctaa 


tagctctatg 


ttattagaaa gcatggctga 


gcgaagcatt 


600 


tcttcgtatt 


ctataactct 


tgcttgattc 


atttttaatc ctcctttacg 


ccttgtgtaa 


660 


ctcttttcta 


tttccacgtt 


gcttttcctt 


taaacttctt tcattaataa 


ttcgtgctaa 


720 


attatgttaa 


tagaggggat 


aagtggacta 


attttctgta agcactaaat 


attctgaaat 


780 


actctgttaa 


ttacctttaa 


atggtataaa 


attagaatga aagaaccttt 


tctttccact 


840 


tttctagtta 


tctttttact 


attaagatgc 


agttttttat acttgtaatt 


gtagcggaat 


900 


gaacgttcat 


tccgtttttg 


aaaagaggtg ataaagtgga atctactcca 


acaaaacaaa 


960 


aagcgatttt 


ttctgcttcg 


cttctgctgt 


ttgcagaaag agggtttgat 


gcaaccacga 


1020 


tgccaatgat 


tgcagagaat 


gccaaagtag 


gagcaggaac aatttatcgc 


tactttaaaa 


1080 


ataaagaaag 


ccttgtaaat 


gaattattcc 


aacagcacgt aaacgagttt 


ttacagtgca 


1140 


ttgaaagcgg tctggcaaac 


gagagagatg 


gataccgaga tgggtttcat 


catatctttg 


1200 


aaggtatggt 


gacatttact 


aaaaaccatc 


ctcgtgctct tggatttatt 


aaaactcata 


12 60 


gccaaggaac 


ttttttaaca 


gaagagagcc 


gcttagcata tcaaaagctg 


gtggaatttg 


1320 


tttgtacgtt 


cttcagagaa 


ggacaaaagc 


aaggtgtgat tagaaatctt 


cctgaaaatg 


1380 


cgctaattgc 


tattttattt 


ggaagtttca 


tggaagtata tgaaatgatt 


gaaaatgact 


1440 


acttatcttt 


aactgatgaa 


cttcttaccg 


gtgtagaaga gagtctgtgg 


gcagcactta 


1500 


gcagacaatc 


atgaaactta 


acaagtgaaa 


gagggataac atg aca att 


aaa gaa 


1555 



Met Thr lie Lys Glu 
1 5 

atg cct cag cca aaa acg ttt gga gag ctt aaa aat tta ccg tta tta 1603 
Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn Leu Pro Leu Leu 

!0 15 20 

aac aca gat aaa ccg gtt caa get ttg atg aaa att gcg gat gaa tta 1651 
Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He Ala Asp Glu Leu 

25 30 35 

gga gaa ate ttt aaa ttc gag gcg cct ggt cgt gta acg cgc tac tta 1699 
Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg Val Thr Arg Tyr Leu 
40 45 50 

tea agt cag cgt eta att aaa gaa gca tgc gat gaa tea cgc ttt gat 1747 
Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp Glu Ser Arg Phe Asp 
55 60 65 

aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat ttt gca gga gac - 1795 
Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp Phe Ala Gly Asp 
70 75 80 85 

ggg tta ttt aca age tgg acg cat gaa aaa aat tgg aaa aaa .gcg cat 1843 
Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn Trp Lys Lys Ala His 

90 95 100 

aat ate tta ctt cca age ttc agt cag cag gca atg aaa ggc tat cat 18 91 



24 



Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met Lys Gly Tyr His 

105 HO 1" 

acq atg atg gtc gat ate gec gtg cag ctt gtt caa aag tgg gag cgt 

Ala net Me? Val Lp He Ala Val Gin Leu Val Gin Lys Trp Glu Arg 
120 125 130 



eta aat gca gat gag cat att gaa gta ccg gaa .gac atg aca cgt tta 
Leu Asn Ala Lp Ilu His He Glu Val Pro Glu Asp Met Thr Arg Leu 
135 14° 145 

acg ctt gat aca att ggt ctt tgc ggc ttt aac tat cgc ttt aac age 
Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr Arg Phe Asn Ser 
150 155 I 60 



ttt tac cqa gat cag cct cat cca ttt att aca agt atg gtc cgt gca 
P^e Tyr tig Asp Gin Pro His Pro Phe He Thr Ser Met Val Arg Ala 

170 l" 75 ■ LOU 

ctq gat gaa gca atg aac aag ctg cag cga gca aat cca gac gac cca 
Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn Pro Asp Asp Pro 

185 19° 195 

act tat gat gaa aac aag cgc cag ttt caa gaa gat ate aag gtg atg 
Ala Tyr Lp Glu Asn Lys Arg Gin Phe Gin Glu Asp lie Lys Val Met 

' 200 205 . 210 

aac gac' eta gta gat aaa att att gca gat cgc aaa gca age ggt gaa 
Asn Asp Leu. Val Asp Lys He He Ala Asp Arg Lys Ala Ser Gly Glu 
215 220 225 

eaa aac aat gat tta tta acg cat atg eta aac gga aaa gat cca gaa 
Gin sir £ Lp Leu Leu Th? His Met Leu Asn Gly Lys Asp Pro Glu 
230 235 240 

acg ggt gag ccg ctt gat gac gag aac att cgc tat caa att att aca 
Th? lly Glu Pro Leu Asp Asp Glu Asn lie Arg Tyr Gin He lie Thr 

ttc tta att geg gga eae gaa aca aca agt ggt ctt tta tea ttt geg 
Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu Leu Ser Phe Ala 

265 270 27b 

ctg tat ttc tta gtg aaa aat cca cat gta tta caa aaa gca gca gaa 
Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin Lys Ala Ala Glu 



280 285 



gaa gca gca cga gtt eta gta gat cct gtt cca age tac aaa caa gtc 
Ilu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser Tyr Lys Gin Val 
295 300 305 

aaa caa ctt aaa tat gtc ggc atg gtc tta aac gaa geg ctg cgc tta 
Lys Gin Leu Lys Tyr Val lly Met Val Leu Asn Glu Ala Leu Arg Leu 
310 3 15 320 



taa cca act get cct geg ttt tec eta tat gca aaa gaa gat acg gtg 
£rp Pro Thr Ala Pro 111 Phe Ser Leu Tyr Ala Lys Glu Asp Thr Val 

330 335 

ctt aaa gga gaa tat cct tta gaa aaa ggc gac gaa eta atg gtt ctg 
Leu Gly lly Glu Tyr Pro Leu Glu Lys Gly Asp Glu Leu Met Val Leu 
y y 345 350 35 5- 

*i-f cct caa ctt cac cgt gat aaa aca att tgg gga gac gat gtg gaa 
ill Pro Gin Leu His Arg Lp Lys Thr He Trp Gly Asp Asp Val Glu 
360 565 3/0 

gag ttc cgt cca gag cgt ttt gaa aat cca agt geg att ccg cag cat 
Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala lie Pro Gin His 
375 380 385 

geg ttt aaa ccg ttt gga aac ggt cag cgt geg tgt ate ggt cag cag 
111 Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys He Gly Gin Gin 
3 9 

ttc get ctt cat gaa gca acg ctg gta ctt ggt atg atg eta aaa cac 



1939 



1987 



2035 



2083 



2131 



2179 



2227 



2275 



2323 



2371 



2419 



2467 



2515 



2563 



2611 



2659 



2707 



2755 



2803 
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Phe Ala Leu His Glu Ala Thr Leu Val' Leu Gly Met Met Leu Lys His 

410 415 420 

ttt gac ttt gaa gat cat aca aac tac gag ctg gat att aaa gaa act 
Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp He Lys Glu Thr 

425 430 435 

tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca aaa teg aaa aaa 
Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala Lys Ser Lys Lys 
440 445 450 

att ccg ctt ggc ggt att cct tea cct age act gaa cag tct get aaa 
He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu Gin Ser Ala Lys 
455 460 465 

aaa gta cgc aaa aag gca gaa aac get cat aat acg ccg ctg ctt gtg 
Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr Pro Leu Leu Val 
470 475 480 485 

eta tac ggt tea aat atg gga aca get gaa gga acg gcg cgt gat tta 
Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr Ala Arg Asp Leu 

490 495 500 

gca gat att gca atg age aaa gga ttt gca ccg cag gtc gca acg ctt 
Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro Gin Val Ala Thr Leu 

505 5io 515 

gat tea cac gee gga aat ctt ccg cgc gaa gga get gta tta att gta 
Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala Val Leu He Val 
520 525 * 530 

acg gcg tct tat aac ggt cat ccg cct gat aac gca aag caa ttt gtc 
Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala Lys Gin Phe Val 
535 540 545 

gac tgg tta gac caa gcg tct get gat gaa gta aaa ggc gtt cgc tac 
Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys Gly Val Arg Tyr 
550 555 560 565 

tec gta ttt gga tgc ggc gat aaa aac tgg get act acg tat caa aaa 
Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr Thr Tyr Gin Lys 

570 575 580 

gtg cct get ttt ate gat gaa acg ctt gee get aaa ggg gca gaa aac 
Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys Gly Ala Glu Asn 

585 590 595 

ate get gac cgc ggt gaa gca gat gca age gac gac ttt gaa ggc aca 
He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp Phe Glu Gly Thr 
600 . 605 610 

tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta gca gee tac ttt 
Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val Ala Ala Tyr Phe 
615 620 625 

aac etc gac att gaa aac agt gaa gat aat aaa tct act ctt tea ctt 
Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser Thr Leu Ser Leu 
630 635 640 645 

caa ttt gtc gac age gec gcg gat atg ccg ctt gcg aaa atg cac ggt 
Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala Lys Met His Gly 

650 655 " 660 

gcg ttt tea acg aac gtc gta gca age aaa gaa ctt caa cag cca ggc 
Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu Gin Gin Pro Gly 

665 670 675 

agt gca cga age acg cga cat ctt gaa att gaa ctt cca aaa gaa get 
Ser Ala Arg Ser Thr Arg His Leu Glu He Glu Leu Pro Lys Glu Ala 
680 685 690 

tct tat caa gaa gga gat cat tta ggt gtt att cct cgc aac tat gaa 
Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro Arg Asn Tyr Glu 
695 700 705 

gga ata gta aac cgt gta aca gca agg ttc ggc eta gat gca tea cag 
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Gly lie val Asn Arg Val Thr Ala Arg Phe Gly Leu Asp Ala Ser Gin 
710 715 720 725 



caa ate cat ctg gaa gca gaa gaa gaa aaa tta get cat ttg cca etc 
Gin lie Arg Leu Glu La Glu Glu Glu Lys Leu Ala His Leu Pro Leu 

730 735 

act aaa aca gta tec gta gaa gag ctt ctg caa tac gtg gag ctt caa 
Ha Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr Val Glu Leu Gin 

750 755 



745 



gat ect gtt acg cgc acg cag ctt cgc gca atg get get aaa acg gtc 
Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala Ala Lys Thr Val 
750 765 770 

tac cca ccg cat aaa gta gag ctt gaa gee ttg ctt gaa aag caa gee 
Cvs Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu Glu Lys Gin Ala 
775 780 785 

tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg ctt gaa ctg ctt 
Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met Leu Glu Leu Leu 

gaa aaa tac ccg gcg tgt gaa atg aaa ttc age gaa ttt ate gaa ctt 
Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu Phe He Ala Leu 

810 815 820 

eta cca age ata cgc ccg cgc tat tac teg att tct tea tea ect cgt 
Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser Ser Ser Pro Arg 

825 830 835 



ate aat gaa aaa caa gca age ate acg gtc age gtt gtc tea gga gaa 
Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val Val Ser Gly Glu 
840 845 850 

cca tag age gga tat gga gaa tat aaa gga att gcg teg aac tat ctt 
111 Trp sL Gly Tyr Gly Glu Tyr Lys Gly He Ala Ser Asn Tyr Leu 
855 860 865 

gec gag ctg caa gaa gga gat acg att acg tgc ttt att tec aca ccg 
Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe He Ser Thr Pro 
870 875 880 885 

caa tea aaa ttt acg ctg cca aaa gac ect gaa acg ccg ctt ate atg 
Gin Ser llu Phe Thr Leu Pro Lys Asp Pro Glu Thr Pro Leu lie Met 

890 895 900 

gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc ttt gtg cag gcg 
Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly Phe Val Gin Ala 

905 910 91b 

cgc aaa cag eta aaa gaa caa gga cag tea ctt gga gaa gca cat tta 
Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly Glu Ala Hxs Leu 

- ' ^ ' 9 2 S 



920 



tac ttc ggc tgc cgt tea ect cat gaa gac tat ctg tat caa gaa gag 
Tvr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu Tyr Gin Glu Glu 
935 " 940 945 



ctt aaa aac gec caa age gaa ggc ate att acg ctt cat acc get ttt 
Leu Tin Asn Ala Gin Ser Glu Gly He He Thr Leu His Thr Ala Phe 
950 955 960 965 

tct cac atg cca aat cag ccg aaa aca tac gtt cag cac gta atg gaa 
Ser Arg Me? Pro Asn Gin Pro Lys Thr Tyr Val Gin His Val Met Glu 

caa gac ggc aag aaa ttg att gaa ctt ctt gat caa gga gcg cac ttc 
Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin Gly Ala His Phe 

985 990 99b 

tat att tgc ' gga gac gga age caa atg gca ect gec gtt gaa gca 
Tyr lie Cys Gly Asp Gly Ser Gin Met Ala Pro Ala Val Glu Ala 
1000 1005 1010 

acg ctt atg aaa age tat get gac gtt cac caa gtg agt gaa gca 



3763 



3811 



3859 



3907 



3955 



4003 



4051 



4099 



4147 



4195 



4243 



4291 



4339 



4387 



4435 



4483 



4531 



4576 



4621 
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Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val Ser Glu Ala 
1015 1020 1025- 

gac get cgc tta tgg ctg cag cag eta gaa gaa aaa ggc cga tac 4666 
Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys Gly Arg Tyr 
1030 1035 1040 

111 Lys 3 Kp £? Jg Hi HI ^^^^^ ^ Cta ^a aaagtagttt 4720 

1045 

agttggttga aggaagatcc gaacgatgaa tegtteggat ctttttattg gtagagtaaa 4780 

egtagattte atctatttag tgacttgtag cggttgattg gagggcaagg tgaagactcc 4840 

aatcaaccgc ggtgtcacat geaagecata cgaaattcat ttctcccatt tattegtett 4900 

ttgtccccac ttaattttta tagegectta aegtttctte tgcgtgacag cagatct ' 4957 

<210> 4 

<211> 1049 

<212> PRT 

<213> Bacillus megaterium 

<400> 4 

Met Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys 
1 5 10 is 

Asn Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys 
• 20 25 30 

lie Ala Asp Glu Leu Gly Glu lie Phe Lys Phe Glu Ala Pro Gly Arg 
35 40 45 *~ y 

* Val Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp 
130 55 60 

Glu Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arcr 
65 70 75 80 

Asp Phe Ala Gly Asp Gly Leu Phe Thr Ser Trp Thr His Glu Lys Asn 

85 90 95 

Trp Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala 

100 105 no 

Met Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val 
115 120 125 

Gin Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu 
130 135 i 4 o ' 

Asp Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn 
±qb 150 155 i 6 o 

Tyr Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr 

165 170 175 

Ser Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala 

I 80 185 190 

Asn Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu 
195 200 205 

Asp lie Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg 
210 215 220 

Lys Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn 
225 . 230 235 240 

Gly Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arq 

245 250 255 
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Tyr Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly 

260 265 270 

Leu Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu 
275 280 285 

Gin Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro 
290 295 300 

Ser Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn 
305 310 315 

Glu Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala 

325 330 -"5 

Lys Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp 

340 345 350 

Glu Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp 
355 ■ 360 365 

Gly Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser 
370 375 380 

Ala He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala 
385 390 395 

Cys He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly 
y 405 410 415 

Met Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu 

420 425 430 

Asp He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys 
435 440 

Ala Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr 
450 455 460 

Glu Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn 
465 470 475 

Thr Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly 

485 490 ga3 

Thr Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro 

500 505 510 

Gin Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly 
515 520 525 

Ala Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn 
530 535 540 

Ala Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val 
545 550 555 

Lys Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala 

565 

Thr Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala 

580 585 

Lys Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp 
595 S00 605 

Asp Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp 
-- - 615 620 



610 



val Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys 

635 



625 



630 



Ser Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu 



645 650 655 



Ala Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu 
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660 

Leu Gin Gin Pro 
675 

Leu Pro Lys Glu 
690 

Pro Arg Asn Tyr 
705 

Leu Asp Ala Ser 



Ala His Leu Pro 

740 



Gly Ser Ala Arg 

680 

Ala Ser Tyr Gin 
695 

Glu Gly lie Val 
710 

Gin Gin lie Arg 
725 

Leu Ala Lys Thr 



665 

Ser Thr Arg His 



Glu Gly Asp His 

700 

Asn Arg Val Thr 
715 

Leu Glu Ala Glu 
730 

Val Ser Val Glu 
745 



670 

Leu Glu lie Glu 
685 

Leu Gly Val lie 



Ala Arg Phe Gly 

720 

Glu Glu Lys Leu 
735 

Glu Leu Leu Gin 
750 



Tyr Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met 
755 760 765 

Ala Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu 
770 775 ~ 780 

Leu Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr 
785 790 795 800 

Met Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser 

805 810 815 

Glu Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He 

820 825 830 

Ser Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser 
835 840 845 

Val Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He 
850 855 860 

Ala Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cvs 
865 87 ° 875 880 

Phe He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu 

885 890 895 

Thr Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg 

900 go5 910 

Gly Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu 
915 920 925 

•Gly Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr 
930 935 94Q 

Leu Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr 
945 9 50 955 960 

Leu His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val 

965 970 ' 975 

Gin His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp 

980 985 990 

Gin Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro 
995 1000 1005 

Ala Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin 
1010 1015 " 1020 

Val Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu 
1025 1030 1035 

Lys Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 
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<210> 5 

<211> 3147 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cytochrome P450BM-3 mutant 



<220> 

<221> CDS 

<222> (1)..(3147) 



aca°att 5 aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat 
Thr He Lys Glu Me? Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
! 5 10 15 

t-ta cca tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa att 
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 . 3° 

aaa gat gaa tta gga gaa ate ttt aaa ttc gag gcg cct ggt cgt gta 
Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 

aca cac tac tta tea agt cag cgt eta att aaa gaa gca tgc gat gaa 
Tnr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp Glu 



50 



55 60 



tea cac ttt gat aaa aac tta agt caa gcg ctt aaa ttt gta cgt gat 
Ser Arg Phe Lp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 10 75 

ttt gca gga gac ggg tta get aca age tgg acg cat gaa aaa aat tgg 
Phe Ala Gly Asp Gly Leu Ala Thr Ser Trp Thr Hxs Glu Lys Asn Trp 

85 90 

aaa aaa gcg cat aat ate tta ctt cca age ttc agt cag cag gca atg 
Lys Lys Ala His Asn lie Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 

Y 100 105 110 

aaa ggc tat cat gcg atg atg gtc gat ate gec gtg cag ctt gtt caa 
Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val Gin 

Y 115 120 125 

aaa tea gag cgt eta aat gca gat gag cat att gaa gta ccg gaa gac 
aag rgg gay i-u 1 - ■» = = _ Val . PEO qi u &sp 

Lvs Trp Glu Arg Leu Asn Ala Asp Glu His l±e t.xu 

130 " 135 1« 



at a aca cat tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat 
Me? Thr Arg Leu Thr Leu Lp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 " 150 155 160 



cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca agt 
Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 



, t . atc cat aca ctg gat gaa gca atg aac aag ctg cag cga gca aat 
Me? Val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 

180 185 I 90 

cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa gat 
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 

200 zuo 



195 



atc aag gtg atg aac gac eta gta gat aaa att att gca gat cgc aaa 
He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg Lys 
210 215 220 

aca aac aat gaa caa age gat gat tta tta acg cat atg eta aac gga 
HI Ser S £» Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 

230 235 



225 



aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat 
Lys Isp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



31 



gta tta. att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca 
Leu lie Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

aag caa ttt gtc gac tgg tta gac caa gcg tct get gat gaa gta aaa 
Lys Gin Phe Val Asp Trp Leu Asp, Gin Ala Ser Ala Asp Glu Val Lys 



864 



912 



960 



1008 



1104 



245 250 255 

?r ^ C T tta ^ gCg gga CaC gaa aca aca agt ^ ctt 816 

Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa 
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca age 

YS oiS 3 G1U G1U Ala Ala Arg Val Leu Val A sp Pro Val Pro Ser 
290 295 300 

tac aaa caa gtc aaa cag ctt aaa tat gtc ggc atg gtc tta aac gaa 
Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 * 310 315 320 

gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca aaa 
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 " 335 

gaa gat acg gtg ctt gga gga gaa tat cct tta gaa aaa ggc gac gaa 105 6 

• Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Giy Isp GlS 

340 345 350 

r ta f^H S*? ° tg att CCt Cag ctt cac cgt g at aaa aca att tgg gga 
Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp Gly 
355 360 " 365 

gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt gcg 1152 
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 

att ccg cag cat gcg ttt aaa ccg ttt gga aac ggt cag cgt gcg tot 1200 
He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 390 . . 395 400 

ri C S?* ~f g C f g ttC gCt ° pt Cat gaa gca acg ct 9 S ta ctt ggt atg 1248 
He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 410 415 

atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat 
Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 425 430 

att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca 
He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act gaa 
Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 
450 455 4 6 o 

ril l Ct ?f fc ^ gta CgC aaa aag gca gaa aac gct cat aa t acg 1440 

Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 

465 470 475 480 

ccg ctg ctt gtg eta tac ggt tea aat atg gga aca gct gaa gga acg 1488 
Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 

485 490 495 

gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg cag 153 6 

Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro Gin 

500 505 sio 

Ctt gat tCa Cac gCC gga aat ctt cc 3 C< 3C gaa gga gct 158 4 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 



1296 



1344 



1392 



1632 



1680 



32 



545 550 555 560 

ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get act 
Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 

5 65 5 / U 

acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get aaa 
Th? Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 

ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac gac 
Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
Y 595 600 605 

ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta 
Glu Ily Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 



610 



615 620 



gea gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa tct 
j5« La Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser 



625 



af -t ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt gcg 
Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa ctt 
Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys 



660 665 



caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa ctt 
Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu i,e 
675 680 685 

rr** nn-f tct tat caa qaa gga gat cat tta ggt gtt att cct 
cca aaa gaa get tct tat ° dcl v aa yy y x ~ n „ r 7 _-, T i ~ p ro 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 

~ £Qq /UU 



690 695 



cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc eta 
Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly Leu 



705 



710 715 



rtat aca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta get 
X "la Ser Gil Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

*Y 2 5 

cat tta cca etc get aaa aca gta tec gta gaa gag ctt ctg caa tac 
His Leu Pro Leu La Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 

52 SS £ SS S S S 3 S S S S 5 S S K 

755 760 765 

r^t- aaa aca ate tgc ccg ccg cat aaa gta gag ctt gaa gee ttg ctt 
III Lys T 7r Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 

_ _ „ -77c /o(J 



770 775 



gaa aag caa gee tac aaa gaa caa gtg ctg gea -aaa cgt tta aca atg 
Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ma i»ys y 

790 /yii 



785 



rtt aaa ctCT c tt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age gaa 
Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 



805 810 



ttt ate gec ctt ctg cca age ata cgc ccg cgc tat tac teg att tct 
ttt ate goo v, -j.-j.e Ser 

Phe He Ala Leu Leu Pro Ser He Arg Pro Arg J.yr iy 



820 



,1728 



tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age gtt 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr vax » 

835 840 

SS S E E S S 13 S 53 E 35 £ "3 £ S 



1776 



1824 



1872 



1920 



1968 



2016 



2064 



2112 



2160 



2208 



2256 



2304 



2352 



2400 



2448 



2496 



2544 



2592 



33 



850 



855 



860 



teg aac tat ctt gcc gag ctg caa gaa gga gat acg att acg tgc ttt 
Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 8 75 880 

att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa acg 
He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc 
Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Glv 

900 905 910 

ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt gga 
Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat ctg 
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

tat caa gaa gag ctt gaa aac gcc caa age gaa ggc ate att acg ctt 
Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr Leu 
945 950 . 955 g6Q 

cat ace get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag 
His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 ' 975 

cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa 
His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 

980 9 8 5 990 

gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct gcc 
Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 iooo 1005 

gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

agt gaa gca gac get cgc tta tgg ctg cag cag- eta gaa gaa aaa 
Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys 
1025 1030 1035 

ggc cga tac gca aaa gac gtg tgg get ggg taa 
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 • 1045 



2640 



2688 



2736 



2784 



2832 



2880 



2928 



2976 



3024 



3069- 



3114 



3147 



<210> 6 

<211> 1048 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Cytochrome P45 0BM-3 mutant 

<400> 6 



Thr He Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
15 io 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 30 

Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 

Thr Arg Tyr Leu Ser Ser Gin Arg Leu lie Lys Glu Ala Cys Asp Glu 
50 55 " 60 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Ala Leu Lys Phe Val Arg Asp 
65 70 75 80 



34 



Phe Ala Gly Asp Gly Leu Ala Thr Ser Trp Thr His Glu Lys Asn Trp 



85 90 



Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 



100 



105 



Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val Gin 
115 120 ' 125 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu Asp 
130 135 I 40 

Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 I 50 155 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 

1. 6 5 

Met val Arg Ala Leu Asp Glu Ala Met Asn Lys Leu Gin Arg Ala Asn 

180 185 190 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

lie Lys Val Met Asn Asp Leu Val Asp Lys lie He Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
225 230 235 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 
y 245 250 z bo 

Gin lie lie Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 ' 280 . 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 



290 



295 



Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 ■ 31° 315 

Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 

Glu Asp Thr val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 350 

Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp Gly 
355 360 365 

Asp Asp val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 



370 375 380 

lie Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 
385 390 395 

lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 41° 4i 

Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 425 

He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Va.l Val Lys Ala 



435 



440 



445 



Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 

450 • 455 
Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465 470 475 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 



35 



485 490 



495 



Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe- Ala Pro Gin 

500 505 5io 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lvs 
545 550 555 560 

Gly Val Arg Tyr Ser Val Phe Gly.Cys Gly Asp Lys Asn Trp Ala Thr 

565 570 575 

Thr Tyr Gin Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 590 

Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 * 615 620 

Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 640 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 . 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

Gin Gin Pro Gly Ser Ala Arg Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 
690 695 700 

Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly Leu 
705 ' 710 715 720 

Asp Ala Ser Gin Gin He Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 ' 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 75Q 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
■ 7 70 775 7 8 o 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 "* ' 800 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 

805 810 815 

Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser 

820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 

Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
Q 50 855 • 860 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 875 ~ 880 

He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

,885 890 895 



36 



Pro Leu lie Met Val Gly Pro Gly Thr Gly val Ma Pro Phe Arg Gly 

900 905 910 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 

920 925 



915 



Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr Leu 
945 950 955 ybU 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 9/i > 

His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 

980 985 990 

Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
y 995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 

Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys 
. 1025 1030 1035 

Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 1045 



<210> 7 

<211> 3147 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Cytochrome P45QBM-3 mutant 



<220> 

<221> CDS 

<222> (1) . . (3147) 

aca°att 7 aaa gaa atg cct cag cca aaa acg ttt gga gag ctt aaa aat 
Thr He Lys Glu Me? Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1 ^ 

cca tta tta aac aca gat aaa ccg gtt caa get ttg atg aaa att 
Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 3U 

rr^t- aaa tta aaa gaa ate ttt aaa ttc gag gcg cct ggt cgt gta 
111 £ Glu leu lly He Phe Lys Phe Glu' Ala Pro Gly Arg Val 

40 45 



35 



acg cgc tac tta tea agt cag cgt eta att aaa gaa gca tgc gat gaa 
Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp G±u 

55 60 



50 



tea cgc ttt gat aaa aac tta agt caa ggg ctt aaa ttt gta cgt gat 
Ser Arg Phe Asp Lys Asn Leu Ser Gin Gly Leu Lys Phe Val Arg Asp 
65 70 75 

ttt gca gga gae ggg tta gtt aca age tgg acg cat gaa aaa aat tgg 
Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn Trp 

85 90 

*aa aaa aca cat aat ate tta ctt cca age ttc agt cag cag gca atg 
Lys Lys III His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 
Y 100 105 110 

aaa ggc tat cat gcg atg atg gtc gat ate gee gtg cag ctt gtt caa 



48 



96 



144 



192 



240 



288 



336 



384 



37 



432 



480 



528 



Lys Gly Tyr His Ala Met Met Val Asp He Ala Val Gin Leu Val Gin 
115 120 125 

aag tgg gag cgt eta aat gca gat gag cat att gaa gta ccg gaa gac 
Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu Asp 
130 135 ~ 140 * 

atg aca cgt tta acg ctt gat aca att ggt ctt tgc ggc ttt aac tat 
Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 150 155 leo 

cgc ttt aac age ttt tac cga gat cag cct cat cca ttt att aca agt 
Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 

165 170 175 

2*? ° gt g ° a Ctg gat gaa gca at( 3 aac aag cag cag cga gca aat 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Gin Gin Arg Ala Asn 

180 185 190 

cca gac gac cca get tat gat gaa aac aag cgc cag ttt caa gaa gat . 624 
Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Isp 
■ 195 . 200 205 

ate aag gtg atg aac gac eta gta gat aaa att att gca gat cgc aaa 
lie Lys val Met Asn Asp Leu Val Asp Lys lie lie Ala isp Arg Lys 

215 220 



576 



672 



816 



864 



gca age ggt gaa caa age gat gat tta tta acg cat atg eta aac gga 720 
Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 
^ 230 235 ■ 240 

aaa gat cca gaa acg ggt gag ccg ctt gat gac gag aac att cgc tat 768 
Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn He Arg Tyr 

245 250 255 

r*n tt*o ^ C tta att gCg gga cac gaa aca aca agt ggt ctt 

Gin He He Thr Phe Leu lie Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 2 70 

tta tea ttt gcg ctg tat ttc tta gtg aaa aat cca cat gta tta caa 
Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin ' 
275 280 285 

aaa gca gca gaa gaa gca gca cga gtt eta gta gat cct gtt cca age 912 
Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 

£vr Lvs Gin % 9 aaa tat gtC ggC atg gtC tta aac ^a 960 

Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 

305 310 315 320 

gcg ctg cgc tta tgg cca act get cct gcg ttt tec eta tat gca aaa 1008 
Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 335 

ni* a ? g gt ? Ctt gga gga 9 aa tat cct tta gaa aaa ggc gac gaa 1056 

Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 " 3 5 o 

r Cta fr^ ? tg att CGt ° ag Ctt c ac"cgt gat aaa aca att tgg gga 1104 

Leu Met Val Leu lie Pro Gin Leu His Arg Asp Lys Thr lie Trp Gly 
355 36O 365 

gac gat gtg gaa gag ttc cgt cca gag cgt ttt gaa aat cca agt qcer 1152 
Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro til Ila 
J/U 375 380 

He Pro Sf! ??* ttt TO& aa ° ggt Cag Cgt *gt ' "00 

lie Pro Gin Ho.s Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cys 

385 390 395 ~ 400 

lit ell m« nf g ??* Ctt Cat 933 gCa acg Ctg 9ta ctt ggt atg 1248 

lie Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met * 

405 410 415 

atg eta aaa cac ttt gac ttt gaa gat cat aca aac tac gag ctg gat 1296 



38 



Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 , 425 430 

att aaa gaa act tta acg tta aaa cct gaa ggc ttt gtg gta aaa gca 1344 

He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

aaa teg aaa aaa att ccg ctt ggc ggt att cct tea cct age act gaa 1392 

Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 
450 455 460 

cag tct get aaa aaa gta cgc aaa aag gca gaa aac get cat aat acg 1440 

Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 

465 470 " "* 475 480 

ccg ctg ctt gtg eta tac ggt tea aat atg gga aca get gaa gga acg 1488 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 

485 490 495 

gcg cgt gat tta gca gat att gca atg age aaa gga ttt gca ccg cag 1536 

Ala Arg Asp Leu Ala Asp lie Ala Met Ser Lys Gly Phe Ala' Pro Gin 

500 505 510 

gtc gca acg ctt gat tea cac gec gga aat ctt ccg cgc gaa gga get 1584 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

gta tta att gta acg gcg tct tat aac ggt cat ccg cct gat aac gca 1632 

Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

aag caa ttt gtc gac tgg tta gae caa gcg tct ' get gat gaa gta aaa 1680 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys 

545 550 555 560 

ggc gtt cgc tac tec gta ttt gga tgc ggc gat aaa aac tgg get act 1728 

Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 

565 ~ ~ 570 " * 575 ' 

acg tat caa aaa gtg cct get ttt ate gat gaa acg ctt gee get aaa 1776 

Thr Tyr Gin. Lys Val Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 ' 590 

ggg gca gaa aac ate get gac cgc ggt gaa gca gat gca age gac gac 1824 

Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

ttt gaa ggc aca tat gaa gaa tgg cgt gaa cat atg tgg agt gac gta 1872 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

gca gee tac ttt aac etc gac att gaa aac agt gaa gat aat aaa tct 1920 

Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser 

625 630 635 640 

act ctt tea ctt caa ttt gtc gac age gee gcg gat atg ccg ctt gcg 1968 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

aaa atg cac ggt gcg ttt tea acg aac gtc gta gca age aaa gaa ctt 2016 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

caa cag cca ggc agt gca cga age acg cga cat ctt gaa att gaa ctt 2 0 64 

Gin Gin Pro Gly Ser Ala Arg- Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

cca aaa gaa get tct tat caa gaa gga gat cat tta ggt gtt att cct 2112 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 
690 695 700 

cgc aac tat gaa gga ata gta aac cgt gta aca gca agg ttc ggc eta 2160 

Arg Asn Tyr Glu Gly He Val Asn Arg Val Thr Ala Arg Phe Gly Leu 

705 710 715 720 

gat gca tea cag caa ate cgt ctg gaa gca gaa gaa gaa aaa tta get 2208 
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Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 735 

cat ttg cca etc get aaa aca gta tec gta gaa gag ctt ctg caa tac 2256 
His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 750 

gtg gag ctt caa gat cct gtt acg cgc acg cag ctt cgc gca atg get 2304 
Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 760 765 

get aaa acg gtc tgc ccg ccg cat aaa gta gag ctt gaa gee ttg ctt 2352 
Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

gaa aag caa gee tac aaa gaa caa gtg ctg gca aaa cgt tta aca atg 2 400 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 800 

» 

ctt gaa ctg ctt gaa aaa tac ccg gcg tgt gaa atg aaa ttc age gaa 2448 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 

805 810 815 

ttt ate gee ctt ctg cca age ata cgc ccg cgc tat tac teg att tct 2496 
Phe He Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser lie Ser 

820 825 " ** 830 

tea tea cct cgt gtc gat gaa aaa caa gca age ate acg gtc age gtt 2544 
Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 

gtc tea gga gaa gcg tgg age gga tat gga gaa tat aaa gga att gcg 2592 
Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 860 

* 

teg aac tat ctt gee gag ctg caa gaa gga gat acg att acg tgc ttt 2 64 0 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 875 880 

att tec aca ccg cag tea gaa ttt acg ctg cca aaa gac cct gaa acg 2 68 8 

He Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

ccg ctt ate atg gtc gga ccg gga aca ggc gtc gcg ccg ttt aga ggc 273 6 

Pro Leu He Met Val Gly- Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 

900 905 ' 910 

ttt gtg cag gcg cgc aaa cag eta aaa gaa caa gga cag tea ctt gga 2784 
Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

gaa gca cat tta tac ttc ggc tgc cgt tea cct cat gaa gac tat ctg 2832 
Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 ■ 940 

tat caa gaa gag ctt gaa aac gee caa age gaa ggc ate att acg ctt 2880 
Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr Leu 
945 950 955 J 960 

cat ace get ttt tct cgc atg cca aat cag ccg aaa aca tac gtt cag 2928 
His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 ~ 975 

cac gta atg gaa caa gac ggc aag aaa ttg att gaa ctt ctt gat caa 2976 
His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 

980 985 990 

gga gcg cac ttc tat att tgc gga gac gga age caa atg gca cct gee 3024 
Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 iooo 1005 



gtt gaa gca acg ctt atg aaa age tat get gac gtt cac caa gtg 
Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1020 



3069 



agt gaa gca gac get cgc tta tgg ctg cag cag eta gaa gaa aaa 3114 



40 



Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys 
1025 1030 1035 

3147 

ggc cga tac gca aaa gac gtg tgg get ggg taa 
Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040 " 1045 



<210> 8 

<211> 1048 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> Cytochrome P450BM-3 mutant 



<400> 8 

Thr lie Lys Glu Met Pro Gin Pro Lys Thr Phe Gly Glu Leu Lys Asn 
1 ~ 5 10 ■ 15 

Leu Pro Leu Leu Asn Thr Asp Lys Pro Val Gin Ala Leu Met Lys He 

20 25 30 

Ala Asp Glu Leu Gly Glu He Phe Lys Phe Glu Ala Pro Gly Arg Val 
35 40 45 

Thr Arg Tyr Leu Ser Ser Gin Arg Leu He Lys Glu Ala Cys Asp Glu 
50 55 60 

Ser Arg Phe Asp Lys Asn Leu Ser Gin Gly Leu Lys Phe Val Arg Asp 
65 70 75 

Phe Ala Gly Asp Gly Leu Val Thr Ser Trp Thr His Glu Lys Asn Trp 

85 9° 95 

Lys Lys Ala His Asn He Leu Leu Pro Ser Phe Ser Gin Gin Ala Met 

100 ■ 105 11° 

Lys Gly Tyr His Ala Met Met. Val Asp He Ala Val Gin Leu Val . Gin 
115 120 i25 

Lys Trp Glu Arg Leu Asn Ala Asp Glu His He Glu Val Pro Glu Asp 
130 135 140 

Met Thr Arg Leu Thr Leu Asp Thr He Gly Leu Cys Gly Phe Asn Tyr 
145 ~ 150 155 ±t,u 

Arg Phe Asn Ser Phe Tyr Arg Asp Gin Pro His Pro Phe He Thr Ser 

165 I 70 

Met Val Arg Ala Leu Asp Glu Ala Met Asn Lys Gin Gin Arg Ala Asn 

180 185 19° 

Pro Asp Asp Pro Ala Tyr Asp Glu Asn Lys Arg Gin Phe Gin Glu Asp 
195 200 205 

He Lys Val Met Asn Asp Leu Val Asp Lys He He Ala Asp Arg Lys 
210 215 220 

Ala Ser Gly Glu Gin Ser Asp Asp Leu Leu Thr His Met Leu Asn Gly 

225 230 235 

Lys Asp Pro Glu Thr Gly Glu Pro Leu Asp Asp Glu Asn' He Arg Tyr 

245 250 255 

Gin He He Thr Phe Leu He Ala Gly His Glu Thr Thr Ser Gly Leu 

260 265 270 

Leu Ser Phe Ala Leu Tyr Phe Leu Val Lys Asn Pro His Val Leu Gin 
275 280 285 

Lys Ala Ala Glu Glu Ala Ala Arg Val Leu Val Asp Pro Val Pro Ser 
290 295 300 
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Tyr Lys Gin Val Lys Gin Leu Lys Tyr Val Gly Met Val Leu Asn Glu 
305 310 315 320 

Ala Leu Arg Leu Trp Pro Thr Ala Pro Ala Phe Ser Leu Tyr Ala Lys 

325 330 " 335 

Glu Asp Thr Val Leu Gly Gly Glu Tyr Pro Leu Glu Lys Gly Asp Glu 

340 345 350 

Leu Met Val Leu He Pro Gin Leu His Arg Asp Lys Thr He Trp Gly 
355 360 ' 3 65 

Asp Asp Val Glu Glu Phe Arg Pro Glu Arg Phe Glu Asn Pro Ser Ala 
370 375 380 

He Pro Gin His Ala Phe Lys Pro Phe Gly Asn Gly Gin Arg Ala Cvs 
385 390 395 400 

He Gly Gin Gin Phe Ala Leu His Glu Ala Thr Leu Val Leu Gly Met 

405 410 • 415 

Met Leu Lys His Phe Asp Phe Glu Asp His Thr Asn Tyr Glu Leu Asp 

420 . 425 430 

He Lys Glu Thr Leu Thr Leu Lys Pro Glu Gly Phe Val Val Lys Ala 
435 440 445 

Lys Ser Lys Lys He Pro Leu Gly Gly He Pro Ser Pro Ser Thr Glu 
450 455 460 

Gin Ser Ala Lys Lys Val Arg Lys Lys Ala Glu Asn Ala His Asn Thr 
465 4 70 475 480 

Pro Leu Leu Val Leu Tyr Gly Ser Asn Met Gly Thr Ala Glu Gly Thr 

485 490 495 

Ala Arg Asp Leu Ala Asp He Ala Met Ser Lys Gly Phe Ala Pro Gin 

500 505 510 

Val Ala Thr Leu Asp Ser His Ala Gly Asn Leu Pro Arg Glu Gly Ala 
515 520 525 

Val Leu He Val Thr Ala Ser Tyr Asn Gly His Pro Pro Asp Asn Ala 
530 535 540 

Lys Gin Phe Val Asp Trp Leu Asp Gin Ala Ser Ala Asp Glu Val Lys 
545 550 555 560 

Gly Val Arg Tyr Ser Val Phe Gly Cys Gly Asp Lys Asn Trp Ala Thr 

565 570 575 

Thr Tyr Gin Lys Val" Pro Ala Phe He Asp Glu Thr Leu Ala Ala Lys 

580 585 590 

Gly Ala Glu Asn He Ala Asp Arg Gly Glu Ala Asp Ala Ser Asp Asp 
595 600 605 

Phe Glu Gly Thr Tyr Glu Glu Trp Arg Glu His Met Trp Ser Asp Val 
610 615 620 

Ala Ala Tyr Phe Asn Leu Asp He Glu Asn Ser Glu Asp Asn Lys Ser 
625 630 635 64 0 

Thr Leu Ser Leu Gin Phe Val Asp Ser Ala Ala Asp Met Pro Leu Ala 

645 650 655 

Lys Met His Gly Ala Phe Ser Thr Asn Val Val Ala Ser Lys Glu Leu 

660 665 670 

Gin Gin Pro Gly Ser Ala Arg -Ser Thr Arg His Leu Glu He Glu Leu 
675 680 685 

Pro Lys Glu Ala Ser Tyr Gin Glu Gly Asp His Leu Gly Val He Pro 
' 690 695 700 

- 

Arg Asn Tyr Glu Gly He Val Asn Arg Val. Thr Ala Arg Phe Gly Leu 
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705 



710 715 720 



Asp Ala Ser Gin Gin lie Arg Leu Glu Ala Glu Glu Glu Lys Leu Ala 

725 730 735 

His Leu Pro Leu Ala Lys Thr Val Ser Val Glu Glu Leu Leu Gin Tyr 

740 745 ' 

Val Glu Leu Gin Asp Pro Val Thr Arg Thr Gin Leu Arg Ala Met Ala 
755 7 <50 765 

Ala Lys Thr Val Cys Pro Pro His Lys Val Glu Leu Glu Ala Leu Leu 
770 775 780 

Glu Lys Gin Ala Tyr Lys Glu Gin Val Leu Ala Lys Arg Leu Thr Met 
785 790 795 

Leu Glu Leu Leu Glu Lys Tyr Pro Ala Cys Glu Met Lys Phe Ser Glu 

805 810 81 = 

Phe lie Ala Leu Leu Pro Ser He Arg Pro Arg Tyr Tyr Ser He Ser 

820 825 830 

Ser Ser Pro Arg Val Asp Glu Lys Gin Ala Ser He Thr Val Ser Val 
835 840 845 

Val Ser Gly Glu Ala Trp Ser Gly Tyr Gly Glu Tyr Lys Gly He Ala 
850 855 860 

Ser Asn Tyr Leu Ala Glu Leu Gin Glu Gly Asp Thr He Thr Cys Phe 
865 870 . 875 880 

lie Ser Thr Pro Gin Ser Glu Phe Thr Leu Pro Lys Asp Pro Glu Thr 

885 890 895 

Pro Leu He Met Val Gly Pro Gly Thr Gly Val Ala Pro Phe Arg Gly 

900 905 91° 

Phe Val Gin Ala Arg Lys Gin Leu Lys Glu Gin Gly Gin Ser Leu Gly 
915 920 925 

Glu Ala His Leu Tyr Phe Gly Cys Arg Ser Pro His Glu Asp Tyr Leu 
930 935 940 

Tyr Gin Glu Glu Leu Glu Asn Ala Gin Ser Glu Gly He He Thr Leu 
945 950 955 

His Thr Ala Phe Ser Arg Met Pro Asn Gin Pro Lys Thr Tyr Val Gin 

965 970 9/i > 

His Val Met Glu Gin Asp Gly Lys Lys Leu He Glu Leu Leu Asp Gin 

980 985 990 

Gly Ala His Phe Tyr He Cys Gly Asp Gly Ser Gin Met Ala Pro Ala 
995 1000 1005 

Val Glu Ala Thr Leu Met Lys Ser Tyr Ala Asp Val His Gin Val 
1010 1015 1° 20 

Ser Glu Ala Asp Ala Arg Leu Trp Leu Gin Gin Leu Glu Glu Lys 
1025 1030 1035 

Gly Arg Tyr Ala Lys Asp Val Trp Ala Gly 
1040' 1045 



<210> 9 

<211> 1032 

<212> DNA 

<213> Nocardia corallina 



<220> 

<221> CDS 

<222> (1).-(1032) 
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<400> 9 

atg acg aca gag gcg acg gtg gcc cga ccg gtg gag etc gaa ggt cac 48 
Met Thr Thr Glu Ala Thr Val Ala Arg Pro Val Glu Leu Glu Gly His 
15 10 15 

egg aca ttc acc tgg ttc acg ccc gcc agg cga aag ccg acg gag tac 96 
Arg Thr Phe Thr Trp Phe Thr Pro Ala Arg Arg Lys Pro Thr Glu Tyr 

20 25 30 

gag etc tac -acc gtg ggt caa cag tec act ccg gac gag tgg ctg cat 144 
Glu Leu Tyr Thr Val Gly Gin Gin Ser Thr Pro Asp Glu Trp Leu His 
35 40 45 

gtg gac tgg ccg ctg cgc ttc gac gac ggc cgc gcc ccg tgg gag gag 192 
Val Asp Trp Pro Leu Arg Phe Asp Asp Gly Arg Ala Pro Trp Glu Glu 
50 55 60 

gag teg agt gcg gta egg acc teg gag tgg teg get tac cgc gac cca 240 
Glu Ser Ser Ala Val Arg Thr Ser Glu Trp Ser Ala Tyr Arg Asp Pro 
65 70 75 ^ 80 

cac caa ctg tgg cag cgt ccc tac gtc age acg tgc aac cag gac cag 288 
His Gin Leu Trp Gin Arg Pro Tyr Val Ser Thr Cys Asn Gin Asp Gin 

85 90 95 

cag gcc etc gcg egg ctg gtc ccc gtc ctg acc atg ggg teg gcg gcg 33 6 

Gin Ala Leu Ala Arg Leu Val Pro Val Leu Th'r Met Gly Ser Ala Ala 

100 105 HO 

ate acg ccc ate tgg teg cag aag ate etc gcc agg tec tac gcc gcc 384 
He Thr Pro He Trp Ser Gin Lys He Leu Ala Arg Ser Tyr Ala Ala 
115 120 " 125 

tgg cca ttc gtc gag tac ggg etc ttc ctg age ctg gcc tac gcc gtg 432 
Trp Pro Phe Val Glu Tyr Gly Leu Phe Leu Ser Leu Ala Tyr Ala Val 
130 135 140 

cgc cag gcc atg tec gac acg gtc cag ttc age gtg gtg ttc cag gcc 4 80 

Arg Gin Ala Met Ser Asp Thr Val Gin Phe Ser Val Val Phe Gin Ala 
145 150 155 160 

gtg gac cgc atg egg ctg etc cag gac ate gtc cac cac ctg gac cac 528 
Val Asp Arg Met Arg Leu Leu Gin Asp He Val His His Leu Asp His 

165 170 175 

ctg cag gag teg ccg gaa ttc age gac gcc ggg gcc cgc gag gcc tgg 57 6 

Leu Gin Glu Ser Pro Glu Phe Ser Asp Ala Gly Ala Arg Glu Ala Trp 

180 185 190 



atg tec gac tec acc ctg gtc ccg ate egg gaa gtg ate gag cgc ate 
Met Ser Asp Ser Thr Leu Val Pro He Arg Glu Val He Glu Arg He 
195 200 205 



ttc gag cct ctg gtc ggc cac ctg gcg aag gcc gag ttg ttc age cgc 
Phe Glu Pro Leu Val Gly His Leu Ala Lys Ala Glu Leu Phe Ser Arg 
225 230 235 240 



624 



gcc gcc age cag gac tgg gtg gag ate ctg gtc gcc ggc acg etc gtc 672 
Ala Ala Ser Gin Asp Trp Val Glu He Leu Val Ala Gly Thr Leu Val 
210 215 220 



720 



cgt gcg cca atg ttc ggg gac ggg acc acg ccg gcg gtg ctg gcg teg 768 
Arg Ala Pro Met Phe Gly Asp Gly Thr Thr Pro Ala Val Leu Ala Ser 

245 250 255 

gcc ctg ctg gac age ggc agg cac etc gaa teg gtc cag gcg etc gtc 816 
Ala Leu Leu Asp Ser Gly Arg His Leu Glu Ser Val Gin Ala Leu Val ■ 

260 265 270 

cgc etc gtc tgc caa gac ccc gtc cat ggc gac cag aac cag gcg act 864 
Arg Leu Val Cys Gin Asp Pro Val His Gly Asp Gin Asn Gin Ala Thr 
275 280 285 

gtg egg egg tgg ate gag gaa tgg cag ccg egg tgc aag gcg gcg gcc 912 
Val Arg Arg Trp He Glu Glu Trp Gin Pro Arg Cys Lys Ala Ala Ala 
290 295 300 
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caq tec ttc ctg ccg acg ttc tec gac tgc ggc ate gac gee aag gaa 
Gin Ser Phe Leu Pro Thr Phe Ser Asp Cys Gly He Asp Ala Lys Glu 
305 310 315 320 

age gee aac gcg ctg tec egg gcg ctg gcg aac cag egg gee gee gtc 
Ser Ala Asn Ala Leu Ser Arg Ala Leu Ala Asn Gin Arg Ala Ala Val 

325 330 335 

gag ggc gec ggc ate acg gca tga 
Glu Gly Ala Gly He Thr Ala 

340 



<210> 10 

<211> 343 

<212> PRT 

<213> Nocardia corallina 

<400> 10 

Met Thr Thr Glu Ala Thr Val Ala Arg Pro Val Glu Leu Glu Gly His 
1 5 10 15 

Ara Thr Phe Thr Trp Phe Thr Pro Ala Arg Arg Lys Pro Thr Glu Tyr 

20 25 30 

Glu Leu Tyr Thr Val Gly Gin Gin Ser Thr Pro Asp Glu Trp Leu His 
35 40 45 

Val Asp Trp Pro Leu Arg Phe Asp Asp Gly Arg Ala Pro Trp Glu Glu 
50 55 60 

Glu Ser Ser Ala Val Arg Thr Ser Glu Trp Ser Ala Tyr Arg Asp Pro 
65 70 75 80 

His Gin Leu Trp Gin Arg Pro Tyr Val Ser Thr Cys Asn Gin Asp Gin 

85 90 95 

Gin Ala Leu Ala Arg Leu. Val Pro Val Leu Thr Met Gly Ser Ala Ala 

100 105 11° 

He Thr Pro He Trp Ser Gin Lys He Leu Ala Arg Ser Tyr Ala Ala 
115 120 : 125 

Trr> Pro Phe Val Glu Tyr Gly Leu Phe Leu Ser Leu Ala Tyr Ala Val 
130 135 140 

Arg Gin Ala Met Ser Asp Thr Val Gin Phe Ser Val Val Phe Gin Ala 
14 5 150 155 

Val Asp Arg Met Arg Leu Leu Gin Asp He Val His His Leu Asp His 

165 170 175 

Leu Gin Glu Ser Pro Glu Phe Ser Asp Ala Gly Ala Arg Glu Ala Trp 

180 185 I 90 

Met Ser Asp Ser Thr Leu Val Pro He Arg Glu Val He Glu Arg He 
195 200 205 

Ala Ala Ser Gin Asp Trp Val Glu He Leu Val Ala Gly Thr Leu Val 
210 ■ 215 220 

Phe Glu Pro Leu Val Gly His Leu Ala Lys Ala Glu Leu Phe Ser Arg 
225 230 235 ^ u 

Arg Ala Pro Met Phe Gly Asp Gly Thr Thr Pro Ala Val Leu Ala Ser 
y 245 250 255 

Ala Leu Leu Asp Ser Gly Arg His Leu Glu Ser Val Gin Ala Leu Val 

260 265 2 70 

Arq Leu Val Cys Gin Asp Pro Val His Gly Asp Gin Asn Gin Ala Thr 
275 ~ 280 285 

Val Arg Arg Trp He Glu Glu Trp Gin Pro Arg Cys Lys Ala Ala Ala 



960 



1008 



1032 



45 



290 295 



300 



Gin Ser Phe Leu Pro Thr Phe Ser Asp Cys Gly He Asp Ala Lys Glu 
305 310 315 320 

Ser Ala Asn Ala Leu Ser Arg Ala Leu Ala Asn Gin Arg Ala Ala Val 

325 330 335 

Glu Gly Ala Gly He Thr Ala 

340 



<210> 11 

<211> 1506 

<212> DNA 

<213> Nocardia corallina 



<220> 

<221> CDS 

<222> (1).. (1506) 

<400> 11 

atg gca teg aac ccc acc cag etc cac gag aag teg aag tec tac gac 

Met Ala Ser Asn Pro Thr Gin Leu His Glu Lys Ser Lys Ser Tyr Asp 
1 5 .10 15 

tgg gac ttc acc tec gtc gag egg cgc ccc aag ttc gag acg aag tac 
Trp Asp Phe Thr Ser Val Glu Arg Arg Pro Lys Phe Glu Thr Lys Tyr 

20 25 30 

aag atg ccc aag aag ggc aag gac ccg ttc cgc gtc ctg ate cgt gac 
Lys Met Pro . Lys Lys Gly Lys Asp Pro Phe Arg Val Leu He Arg Asp 
35 40 45 

tac atg aag atg gaa gcg gag aag gac gac egg acc cat ggc ttc etc 
Tyr Met Lys Met Glu Ala Glu Lys Asp Asp Arg Thr His Gly Phe Leu 
50 55 60 

gac ggc gee gtg egg acg cgt gag gee acc agg att gag ccg egg ttc 
Asp Gly Ala Val Arg Thr Arg Glu Ala Thr Arg He Glu Pro Arg Phe 
65 70 75 80 

get gag gee atg aag ate atg gtg ccg cag ctg acc aac .gee gag tac 
Ala Glu Ala Met Lys He Met Val Pro Gin Leu Thr Asn Ala Glu Tyr 

85 90 95 



48 



96 



144 



192 



240 



288 



cag gcg gtg gcg ggc tgc gga atg ate ate teg gee gtc gag aac cag 33 6 

Gin Ala Val Ala Gly Cys Gly Met He He Ser Ala Val Glu Asn Gin - 

100 105 no 

gag etc cgt cag ggc tac gee get cag atg etc gat gag gtg egg cac ' 38 4 

Glu Leu Arg Gin Gly Tyr Ala Ala Gin Met Leu Asp Glu Val Arg His 
115 120 125 

gcg cag etc gag atg acg eta cgc aac tac tac gcg aag cac tgg tgc 432 
Ala Gin Leu Glu Met Thr Leu Arg Asn Tyr Tyr Ala Lys His Trp Cys 
130 . , 135 140 

gat ccc tec ggc ttc gac ate ggt cag cgc ggc ctg tac cag cac ccc 480 
Asp Pro Ser Gly Phe Asp He Gly Gin Arg Gly Leu Tyr Gin His Pro 
145 150 155 160 

gcg ggg ctg gtg tec ate ggc gag ttc cag cac ttc aat act ggt gac 528 
Ala Gly Leu Val Ser He Gly Glu Phe Gin His Phe Asn Thr Gly Asp 

165 170 175 

ccg ctt gac gtc ate ate gat etc aac ate gtg gee gag acg gcg ttc 
Pro Leu Asp Val He He Asp Leu Asn He Val Ala Glu Thr Ala Phe 

180 185 190 

acg aac ate ctg ctg gtg gee act cca cag gtc gee gtg gee aac ggg 624 
Thr Asn lie Leu Leu Val Ala Thr Pro Gin Val Ala Val Ala Asn Gly 
195 200 205 

gac aac gcg atg gee age gtg ttc etc teg ate cag teg gac gag gee 672 



576 



46 



Asp Asn Ala Met Ala Ser Val Phe Leu Ser He Gin Ser Asp Glu Ala 
210 215 220 

agg cac atg gcc aac ggg tac ggc teg gtc atg gcg ctg ctg gag aac 
A?g His Met Ala Asn Gly Tyr Gly Ser Val Met Ala Leu Leu Glu Asn 

230 235 



225 



gag gac aac etc eeg ctg etc aac cag tct etc gat egg cac ttc tgg 
ilu Asp Asn Leu Pro Leu Leu Asn Gin Ser Leu Asp Arg His Phe Trp 

245 250 

r-ot acc cac aag gcc ttg gac aac gcg gtc gga tgg tgt teg gag tat 
Arg Ala His LyI Ala Leu Asp Asn Ala Val Gly Trp Cys Ser Glu Tyr 

260 265 270 

ggc gcc cgc aag egg cea tgg age tac aag gee cag tgg gag gaa tgg 
Gly Ala Arg Lys Arg Pro Trp Ser Tyr Lys Ala Gin Trp Glu Glu Trp 
275 280 285 

gtc gtc gac gac ttc gtg ggc ggc tac ate gac cga etc age gag ttc 
Val Val Asp Asp Phe Val Gly Gly Tyr He Asp Arg Leu Ser Glu Phe 
290 295 300 

gac gtt cag get ccg gcc tgc ctt ggc gcg gcc gcc gac gag gtc aag 
III Val Gin La Pro Ala Cys Leu Gly Ala Ala Ala Asp Glu Val Lys 
305 310 315 

tgg teg cac cac acg etc ggt cag gtg ctg teg gcg gtg tgg ccg ctg 
Trp Ser His His Thr Leu Gly Gin Val Leu Ser Ala Val Trp Pro Leu 

325 330 

aac ttc tgg ege teg gac gee atg gga ccg gcg gac ttc gag tgg ttc 
Asn Phe Trp Arg Ser Asp Ala Met Gly Pro Ala Asp Phe Glu Trp Phe 

340 345 3oU 

gag aac cac tac ccg ggc tgg age gcg gee tac cag ggt tac tgg gag 
111 Asn His Tyr Pro Gly Trp Ser Ala Ala Tyr Gin Gly Tyr Trp Giu 
355 360 365 

n<rn t- S r aacr acg etc gcc gac cea gca ggc gga cgc ate atg etc cag 
l!y Tyr Lyl Ala Leu Ala Asp Pro Ala Gly Gly Arg He Met Leu Gin 
3 *7 0 

gag ctg eeg ggt etg ccg ccg atg tgt cag gtg tgc cag gtg ccg tgc 
llu Leu Pro Gly Leu Pro Pro Met Cys Gin Val Cys Gin Val Pro Cys 
385 390 395 

gtg atg ccg egg etg gat atg aac gcc gcg egg ate ate gag ttc gag 
Val Met Pro Arg Leu Asp Met Asn Ala Ala Arg He He Glu Phe Glu 

405 410 



ggg cag aaa ate gcg ctg tgc age gaa cec tgc cag egg ate ttc ace 
lly Gin Lys He Ala Leu Cys Ser Glu Pro Cys Gin Arg lie Phe Thr 



420 . 425 



* flr - taa ccg gag gcg tac cgc cac cgc aag caa tac tgg gcc cgc tac 
Asn Trp Pro Glu Ala Tyr A?g His Arg Lys Gin Tyr Trp Ala Arg Tyr 
435 440 

cac gga tgg gac ctg gcg gac gtc ate gtt gat etc ggc tac ate cgc 
His Gly Trp Asp Leu Ala Asp Val He Val Asp Leu Gly Tyr He Arg 



450 455 



ccg gac ggc aag acc etc ate ggc cag ccg etg etc gag atg gag egg 
Pro Asp Gly LyI Thr Leu He Gly Gin Pro Leu Leu Glu Met Glu Arg 



465 



470 475 



ctg tgg acc ate gac gac ate egg gcc ctt cag tac gaa gtc aag gac 
Leu Trp Thr He Asp Asp He Arg Ala Leu Gin Tyr Glu Val Lys Asp 



485 49° 



ccg ttg cag gag gcg tga 
Pro Leu Gin Glu Ala 

500 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1506 
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<210> 12 

<211> 501 

<212> PRT 

<213> Nocardia corallina 

<400> 12 

Met Ala Ser Asn Pro Thr Gin Leu His Glu Lys Ser Lys Ser Tyr Asp 
1 5 10 15 

Trp Asp Phe Thr Ser Val Glu Arg Arg Pro Lys Phe Glu Thr Lys Tyr 

20 25 30 

Lys Met Pro Lys Lys Gly Lys Asp Pro Phe Arg Val Leu lie Arg Asp 
35 .40 45 

Tyr Met Lys Met Glu Ala Glu Lys Asp Asp Arg Thr His Gly Phe Leu 
50 55 go 

Asp Gly Ala Val Arg Thr Arg Glu Ala Thr Arg lie Glu Pro Arg Phe 
65 70 75 80 

Ala Glu Ala Met Lys He Met Val Pro Gin Leu Thr Asn Ala Glu Tyr 

85 90 95 

Gin Ala Val Ala Gly Cys Gly Met He lie Ser Ala Val Glu Asn Gin 

100 105 no 

Glu Leu Arg Gin Gly Tyr Ala Ala Gin Met Leu Asp Glu Val Arg His 
115 120 125 

Ala Gin Leu Glu Met Thr Leu Arg Asn Tyr Tyr Ala Lys His Trp Cys 
130 135 14Q 

Asp Pro Ser Gly Phe Asp He Gly Gin Arg Gly Leu Tyr Gin His Pro 
145 . 150 155 160 

Ala Gly Leu Val Ser He Gly Glu Phe Gin His Phe Asn Thr Gly Asp 

165 170 175 

Pro Leu Asp Val He He Asp Leu Asn He Val Ala Glu Thr Ala Phe 

180 ' 185 190 

Thr Asn He Leu Leu Val Ala Thr Pro Gin Val Ala Val Ala Asn Gly 
195 . 200 205 

Asp Asn Ala Met Ala Ser Val Phe Leu Ser He Gin Ser Asp Glu Ala 
21° 215 220 

Arg His Met Ala Asn Gly Tyr Gly Ser Val Met Ala Leu Leu Glu Asn 
225 230 235 240 

Glu Asp Asn Leu Pro Leu Leu Asn Gin Ser Leu Asp Arg His Phe Trp 

245 250 255 

Arg Ala His Lys Ala Leu Asp Asn Ala Val Gly Trp Cys Ser Glu Tyr 

2 60 2 65 270 

Gly Ala Arg Lys Arg Pro Trp Ser Tyr Lys Ala Gin Trp Glu Glu Trp 
275 280 285 

Val Val Asp Asp Phe Val Gly Gly Tyr He Asp Arg Leu Ser Glu Phe 
290 295 300 

Gly Val Gin Ala Pro Ala Cys Leu Gly Ala Ala Ala Asp Glu Val Lys 
305 310 315 320 

Trp Ser His His Thr Leu Gly Gin Val Leu Ser Ala Val Trp Pro Leu 

325 330 335 

Asn Phe Trp Arg Ser Asp Ala Met Gly Pro Ala Asp Phe Glu Trp Phe 

340 345 350 

Glu Asn His Tyr Pro Gly Trp Ser Ala Ala Tyr Gin Gly Tyr Trp Glu 
355 360 • 365 



48 



Gly Tyr Lys Ala 
370 

Glu Leu Pro Gly 
385 

Val Met Pro Arg 



Gly Gin Lys lie 

420 



Asn Trp Pro Glu 
435 

His Gly Trp Asp 
450 

Pro Asp Gly Lys 
465 

Leu Trp Thr He 



Pro Leu Gin Glu 

500 



Leu Ala Asp Pro 
375 

Leu Pro Pro Met 
390 

Leu Asp Met Asn 
405 

Ala Leu Cys Ser 



Ala Tyr Arg His 

440 



Leu Ala Asp Val 
' 455 

Thr Leu He Gly 
470 

Asp Asp He Arg 
485 

Ala 



Ala Gly Gly Arg 

380 



Cys Gin Val Cys 
395 

Ala Ala Arg He 
410 

Glu Pro Cys Gin 
425 

Arg Lys Gin Tyr 



He Val Asp Leu 

460 



Gin Pro Leu Leu 
475 

Ala Leu Gin Tyr 
490 



He Met Leu Gin 



Gin Val Pro Cys 

400 



He Glu Phe Glu 
415 

Arg He Phe Thr 
430 

Trp Ala Arg Tyr 
445 

Gly Tyr He Arg 



Glu Met Glu Arg 

480 

Glu Val Lys Asp 
495 



<210> 13 

<211> 1494 

<212> DNA 

<213> Xanthobacta sp 



<220> 
<221> 
<222> 



CDS 

(1) . . (1494) 



cq tc ttg aat egg gac gat tgg tac gac ate gcg cgc gat gtc 
nil III Leu Leu Asn A^g Asp Asp Trp Tyr Asp He Ala Arg Asp Val 
1 5 10 

gac tgg acg etc age tat gtc gac cgc gcg gtc gee ttt cce gag gag 
Asp Trp Thr Leu Ser Tyr Val Asp Arg Ala Val Ala Phe Pro G±u 

20 25 

55 "'. IS K S K £ S S S E S S S S C 

35 40 

aaa ccc ttc egg gtc tec ttc cgc gaa tat gtg atg gtc cag cgc gac 
III Pro Phe Arg Val Ser Phe Arg Glu Tyr Val Met Val Gin Arg Asp 
50 55 60 

S K a K S IS S S 5 K E K !S S E 5 

65 70 75 

S SE S S E K K IS K 5 E S H S 2 S 

85 • 90 

s s s s s s « s s si a s s s ss s 

100 105 

ccc ttc gtg cgc ttc gcg ccg tec gec cgc tgg cgc age etc ggg gcg 
A?g Phe va? A?g Phe Ala Pro Ser Ala Arg Trp Arg Ser Leu Gly Ala 
115 120 

K IS S S3 K IS = S S S - £ S S 5 S 

130 135 I 40 

age eae gat etg etc aac gat tee ccg age ttc gac tgg age cag egg 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



I 



49 



Ser His Asp Leu Leu Asn Asp Ser Pro Ser Phe Asp Trp Ser Gin Arg 
145 150 155 160 

S a ° gaC gaa tgg gcg gtt ctc ? cc acc aac ctg ttc 

Ala Phe His Thr Asp Glu Trp Ala Val Leu Ala Thr Arg Asn Leu Phe 

165 170 175 

gac gac ate atg ctc aac gec gac tgc gtg gag gcg gcg ctc gec acc 57 6 

Asp Asp He Met Leu Asn Ala Asp Cys Val Glu Ala Ala Leu Ala Thr " 

180 185 190 



age ctg acg ctg gag cac ggc ttc acc aac ate cag ttc gtg gcg ctc 
Ser Leu Thr Leu Glu His Gly Phe Thr Asn He Gin Phe Val Ala Leu 
195 200 205 

gec tec gac gee atg gaa gee ggc gac gtg aac ttc tec aac ctc ttg 
Ala Ser Asp Ala Met Glu Ala Gly Asp Val Asn Phe Ser Asn Leu Leu 
210 215 220 



teg age ate cag acc gac gag gcg egg cac gee cag ttg ggc ttt ccc 
Ser Ser He Gin Thr Asp Glu Ala Arg His Ala Gin Leu Gly Phe Pro 
225 230 235 240 



acc etc gac gtg atg atg aag cac gac cec aag cgc gec cag cag ate 
Thr Leu Asp Val Met Met Lys His Asp Pro Lys Arg Ala Gin Gin He 

245 2 5o 255 

ctg gac gtc gee ttc tgg cgc tec tac cgc ate ttc cag gcg gtg acc 
Leu Asp Val Ala Phe Trp Arg Ser Tyr Arg He Phe Gin Ala Val ?hr 

260 265 270 

ggc gtc tec atg gac tac tac acg ccg gtc gee aag egg cag atg teg 
Gly Val Ser Met Asp Tyr Tyr Thr Pro Val Ala Lys Arg Gin Met Ser 
275 280 285 

ttc aag gag ttc atg ctg gag tgg ate gtc aag cat cat gag cgc ate 
Phe Lys Glu Phe Met Leu Glu Trp He Val Lys His His Glu Arg He 
290 295 



300 



365 



528 



624 



672 



720 



768 



816 



864 



912 



1008 



ctg cgc gac tac ggc ctc cag aag ccc tgg tac tgg gac acg ttc gag 9 60 

Leu Arg Asp Tyr Gly Leu Gin Lys Pro Trp Tyr Trp Asp Thr Phe Glu 
305 310 315 320 

f ag r tC ? at Ca ° ggC Cac cac gcg ct S cac ate ggc acc tgg ttc 

Lys Thr Leu Asp His Gly His His Ala Leu His He Gly Thr Trp Phe 

325 330 335 

tgg cgc ccg acc ctg ttc .tgg gat ccc aat ggc ggc gtc teg cgc gag 
Trp Arg Pro Thr Leu Phe Trp Asp Pro Asn Gly Gly Val Ser Arg Glu 

34a 345 " 350 

gag egg cgc tgg ctg aac cag aag tat ccg aac tgg gaa gag age tgg 
Glu Arg Arg Trp Leu Asn Gin Lys Tyr Pro Asn Trp Glu Glu Ser Trp 
355 360 



1056 



1104 



ri?r u ? r 9 9 gag atC atC tCC aac atC aat gcg 9<? c aac att 1152 

Gly Val Leu Trp Asp Glu He He Ser Asn He Asn Ala Gly Asn He 

370 375 380 

rf» f/* S? C r tg S CC gag aCg Ctg ccg atg ctg tgc aac 9 tc ac c aac 1200 
Glu Lys Thr Leu Pro Glu Thr Leu Pro Met Leu Cys Asn Val Thr Asn 

385 3 90 395 400 

ctg ccc ate ggc teg cac tgg gac cgc ttc cac ctg aag ccc gag cag 1248 
Leu Pro He Gly Ser His Trp Asp Arg Phe His Leu Lys Pro Glu Gin 

405 410 415 

etc gtc tac aag ggg egg ctc tac acc ttc gac age gac gtc tec aag 1296 
Leu Val Tyr Lys Gly Arg Leu Tyr Thr Phe Asp Ser Asp Val Ser Lvs 

420 425 430 

tgg ate ttc gag ctc gat ccg gag cgc tat gec ggc cac acc aac gtg 1344 
Trp He Phe Glu Leu Asp Pro Glu Arg Tyr Ala Gly His Thr Asn Val 
435 440 445 

gtc gac cgc ttc ate ggc ggg cag ate cag ccc atg acc ate gag ggc 1392 



50 



Val Asp Arg Phe He Gly Gly Gin He Gin Pro Met Thr He Glu Gly 
450 455 460 

rri-n r-tc aac taa atg ggc ctg acg ccc gaa gtc atg ggc aag gac gtg 
Val Leu Asn Trp Me? Gly Leu Thr Pro Glu Val Met Gly Lys Asp Val 

470 475 i±ou 



465 



ttc aac tac cgt tgg gcc ggc gat tac gcc gag aac egg ate gcc gcc 
Phe Asn Tyr Arg Trp Ala Gly Asp Tyr Ala Glu Asn Arg He Ala Ala 



485 490 



gag taa 
Glu 



<210> 14 

<211> 497 

<212> PRT 

<213> Xanthobacta sp 



<400> 14 

Met Ala Leu Leu Asn Arg Asp Asp Trp Tyr Asp He Ala Arg Asp Val 



1 



Asp Trp Thr Leu Ser Tyr Val Asp Arg Ala Val Ala Phe Pro Glu Glu 



20 25 

Trp Lys Gly Glu Lys Asp He Cys Gly Thr Ala Trp Asp Asp Trp Asp 
35 40 45 

Phe Arg Val Ser Phe Arg Glu Tyr Val Met Val Gin Arg Asp 

55 60 



Glu Pro 
50 



Lys Glu Ala Ser Val Gly Ala He Arg Glu Ala Met Val Arg Ala Lys 
65 ™ 75 

Ala Tyr Glu Lys Leu Asp Asp Gly His Lys Ala Thr Ser His Leu His 



85 



Met Gly Thr He Thr Met Val Glu His Met Ala Val Thr Met Gin Ser 

100 105 

Arg Phe Val Arg Phe Ala Pro Ser Ala Arg Trp Arg Ser Leu Gly Ala 
115 120 125 

Phe Gly Met Leu Asp Glu Thr Arg His Thr Gin Leu Asp Leu Arg Phe 
130 135 140 



Ser His Asp Leu Leu Asn Asp Ser Pro Ser Phe Asp Trp Ser Gin Arg 
145 150 155 



Ala Phe His Thr Asp Glu Trp Ala Val Leu Ala Thr Arg Asn Leu Phe 

165 l" 70 

Asp Asp He Met Leu Asn Ala Asp Cys Val Glu Ala Ala Leu Ala Thr 

180 XS5 

Ser Leu Thr Leu Glu His Gly Phe Thr Asn He Gin Phe Val Ala Leu 
195 200 2 °5 

Ala Ser Asp Ala Met Glu Ala Gly Asp Val Asn Phe Ser Asn Leu Leu 
215 220 



1440 



1488 



1494 



Ser Ser He Gin Thr Asp Glu Ala Arg His Ala Gin Leu Gly Phe Pro 
225 230 235 

Thr Leu Asp Val Met Met Lys His Asp Pro Lys Arg Ala Gin Gin He 



245 250 

Leu Asp val Ala Phe .Trp Arg Ser Tyr Arg He Phe Gin Ala Val Thr 

2 60 265 ~ 



Gly Val Ser Met Asp Tyr Tyr Thr Pro Val Ala Lys Arg Gin Met Ser 

280 285 



210 



275 



51 



Phe Lys Glu Phe Met Leu Glu Trp lie Val Lys His His Glu Arg lie 
290 295 300 

Leu Arg Asp Tyr Gly Leu Gin Lys Pro Trp Tyr Trp Asp Thr Phe Glu 
305 310 315 320 

Lys Thr Leu Asp His Gly His His Ala Leu His lie Gly Thr Trp Phe 

325 330 335 

Trp Arg Pro Thr Leu Phe Trp Asp Pro Asn Gly Gly Val Ser Arg Glu 

340 345 35 0 

Glu Arg Arg Trp Leu Asn Gin Lys Tyr Pro Asn Trp Glu Glu Ser Trp 
355 360 365 

Gly Val Leu Trp Asp Glu He He Ser Asn He Asn Ala Gly Asn He 
370 375 380 

Glu Lys Thr Leu Pro Glu Thr Leu Pro Met Leu Cys Asn Val Thr Asn 
385 390 395 400 

Leu Pro He Gly Ser His Trp Asp Arg Phe His Leu Lys Pro Glu Gin 

405 410 415 

Leu Val Tyr Lys Gly Arg Leu Tyr Thr Phe Asp Ser Asp Val Ser Lys 

420 425 430 

Trp He Phe Glu Leu Asp Pro Glu Arg Tyr Ala Gly His Thr Asn Val 
435 440 445 

Val Asp Arg Phe He Gly Gly Gin He Gin Pro Met Thr He Glu Gly 
450 455 460 

Val Leu Asn Trp Met Gly Leu Thr Pro Glu Val Met Gly Lys Asp Val 
465 470 475 • 480 

Phe Asn Tyr Arg Trp Ala Gly Asp Tyr Ala Glu Asn Arg He Ala Ala 

485 490 495 

Glu 



<210> 15 

<211> 1026 

<212> DNA 

<213> Xanthobacta sp. 



<220> 

<221> CDS 

<222> (1) . . (1026) 



<400> 15 

atg aca cag cag cgc ccc acc cgc acg cgc gag cgc aag aag acc tgg 
Met Thr Gin Gin Arg Pro Thr Arg Thr Arg Glu Arg Lys Lys Thr Trp 
15 10 15 

acg get ttc ggc aat etc gga cgc aag ccg acc gac tac gag gtc gtc 
Thr Ala Phe Gly Asn Leu Gly Arg Lys Pro Thr Asp Tyr Glu Val Val 

20 25 30 

acc cac aac atg aac cac acc atg cgc ggc acg ccc ctg gag ctg teg 
Thr His Asn Met Asn His Thr Met Arg Gly Thr Pro Leu Glu Leu Ser 
35 40 45 

ccg acg gtg cac gec aat gtg tgg etc aag aag aac cgc gac gag ate 
Pro Thr Val His Ala Asn Val Trp Leu Lys Lys Asn Arg Asp Glu He 
50 55 ~ 60 

gcg etc aag gtc gac age tgg gat ctg ttc cgc gat ccc gac cgc acc 
Ala Leu Lys Val Asp Ser Trp Asp Leu Phe Arg Asp Pro Asp Arg Thr 
65 70 ,75 * 80 

acc tac gac acc tac gtc aag atg cag gac gac cag gag acc tat gtc 



52 



Thr Tyr Asp Thr Tyr Val Lys Met Gin Asp Asp Gin Glu Thr Tyr Val 

85 90 95 



gac aac ctg etc ctg tec tac acc ggc gag ggc cgc tac gac gag gag 
Asp Asn Leu Leu Leu Ser Tyr Thr Gly Glu Gly Arg Tyr Asp Glu Glu 

100 105 H° 

ctt tec teg cgc age etc gac etc ctg tec gcg ggg ctg acg ccg acc 
Leu Ser Sel Arg Ser Leu Asp Leu Leu Ser Ala Gly Leu Thr Pro Thr 

120 125 



115 



cgc tat ctg ggc cat ggg ctg cag atg etc gcg gee. tat ate cag cag 
Arg Tyr Leu Gly His Gly Leu Gin Met Leu Ala Ala Tyr He Gin Gin 
130 135 l^O 

etc gee ccg teg gee tat gtg ggc aat tgc gcg gtg ttc cag acc tec 
Leu Ala Pro Ser La Tyr Val Gly Asn Cys Ala Val Phe Gin Thr Ser 
' 150 155 lbU 



145 



gac gcg ctg cgc. cgc gtg cag cgc gtc gee tac cgc acc cgc cag etc 
Isp Ala Leu Arg Arg Val Gin Arg Val Ala Tyr Arg Thr Arg Gin Leu 

«L 7 0 



165 



gee gac gec cat ccg gec cgc ggc ttc ggc tec ggc gac egg gcg gtg 
La Lp Ala His Pro Ala Arg Gly Phe Gly Ser Gly Asp Arg Ala Val 
' " 180 185 190 



tgg gag aag tec ccg gac tgg cag ccc ate cgc aag gec ate gag gag 
Tro Glu Lys Ser Pro Asp Trp Gin Pro He Arg Lys Ala He Glu Glu 

200 205 



195 



ctq etc gtc acc ttc gaa tgg gac aag gcg etc gee ggc acc aat ttc 
Leu Leu Val Thr Phe Glu Trp Asp Lys Ala Leu Ala Gly Thr Asn Phe 
" 215 220 



210 



atg gtg aag ccg ate etc gac gag ctg ttc etc aac cac ctg gcg cgc 
val Val Lys Pro He Leu Asp Glu Leu Phe Leu Asn His Leu Ala Arg 
225 230 235 

ctg etc cac gtg gag ggc gac gag etc gac age etc gtg ctg egg aac 
Leu Leu His Val Glu Gly Asp Glu Leu Asp Ser Leu Val Leu Arg Asn 

245 250 255 

ctt cac ggc gac gee cag cgc cae gec cgc tgg acg gee gcg etc ggc 
Leu His Gly Asp Ala Gin Arg His Ala Arg Trp Thr Ala Ala Leu Gly 

260 265 270 

cqc ttc gec gtc gag cag aac gtg aac aac cgc acg gtc ctg cgc gac 
Arc Pne La Val Glu Gin Asn Val Asn Asn Arg Thr Val Leu Arg Asp 

280 285 



275 



gee ate gec ggc tgg cac gag acc ggc gag gcg gtc etc gec gcg ggc 
La lie Ala Gly Trp His Glu Thr Gly Glu Ala Val Leu Ala Ala Gly 
290 295 3°° 

gee ggg atg ctt gcg age cgc gee ccc age gcg gat gcg gco aag ate 
La Gly Met Leu Ala Ser Arg Ala Pro Ser Ala Asp Ala Ala Lys lie 
305 310 315 

gee gac gag gtc cgc gec acg etc gcg cag ctg cae gee aat gcg ggc 

Arg 
325 



gec gac gag gtc cgc gee acg etc gcg cag uuy — = 

Ita Isp Glu Val Arg Ala Thr Leu Ala Gin Leu His Ala Asn Ala Gly 

330 



etc ggg cac gat gee tga 
Leu Gly His Asp Ala 



340 



<210> 16 

<211> 341 

<212> PRT 

<213> Xanthobacta sp 



<400> 16 

Met Thr Gin Gin Arg Pro Thr Arg Thr Arg Glu Arg Lys Lys Thr Trp 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1026 



h 



53 



15 10 15 

Thr Ala Phe Gly Asn Leu Gly Arg Lys Pro Thr Asp Tyr Glu Val Val 

20 25 30 

Thr His Asn Met Asn His Thr Met Arg Gly Thr Pro Leu Glu Leu Ser 
35 40 45 

Pro Thr Val His Ala Asn Val Trp Leu Lys Lys Asn Arg Asp Glu lie 
50 55 60 

Ala Leu Lys Val Asp Ser Trp Asp Leu Phe Arg Asp Pro Asp Arg Thr 
65 70 75 80 

Thr Tyr Asp Thr Tyr Val Lys Met Gin Asp Asp Gin Glu Thr Tyr Val 

85 90 95 

Asp Asn Leu Leu Leu Ser Tyr Thr Gly Glu Gly Arg Tyr Asp Glu Glu 

100 105 . 110 

Leu Ser Ser Arg Ser Leu Asp Leu Leu Ser Ala Gly Leu Thr Pro Thr 
115 120 "* 125 

Arg Tyr Leu Gly His Gly Leu Gin Met Leu Ala Ala Tyr lie Gin Gin 
130 135 140 

Leu Ala Pro Ser Ala Tyr Val Gly Asn Cys Ala Val Phe Gin Thr Ser 
145 150 155 160 

Asp Ala Leu Arg Arg Val Gin Arg Val Ala Tyr Arg Thr Arg Gin Leu 

165 170 175 

Ala Asp Ala His Pro 'Ala Arg Gly Phe Gly Ser Gly Asp Arg Ala Val 

180 185 ' 190 

Trp Glu Lys Ser Pro" Asp Trp Gin Pro lie Arg Lys Ala lie Glu Glu 
195 200 . ' 205 

Leu Leu Val Thr Phe Glu Trp Asp Lys Ala Leu Ala Gly Thr Asn Phe 
210 215 220 

Val Val Lys Pro lie Leu Asp Glu Leu Phe Leu Asn His Leu Ala Arg 
225 230 235 240 

Leu Leu His Val Glu Gly Asp Glu Leu Asp Ser Leu Val Leu Arg .Asn 

245 250 255 

Leu His Gly Asp Ala Gin Arg His Ala Arg Trp Thr Ala Ala Leu Gly 

260 265 270 

Arg Phe Ala Val Glu Gin Asn Val Asn Asn Arg Thr Val Leu Arg Asp 
275 280 285 

Ala He Ala Gly Trp His Glu Thr Gly Glu Ala Val Leu Ala Ala Gly 
290 295 300 

Ala Gly Met Leu Ala Ser Arg Ala Pro Ser Ala Asp Ala Ala Lys He 
305 310 315 "* 320 

Ala Asp Glu Val Arg Ala Thr Leu Ala Gin Leu His Ala Asn Ala Gly 

325 330 335 

Leu Gly His Asp Ala 

340 



<210> 17 

<211> 267 

<212> DNA 

<213> Xanthobacta sp. 



<220> 
<221> 
<222> 



CDS 

(1) . . (267) 



54 



gcc acc ctg gag gcc ate atg. acc aag cgc gag atg ctg ccc ctg cag 

Ala Thr Leu Glu Ala He Met Thr Lys Arg Glu Met Leu Pro Leu Gin 

65 70 75 80 

tgg ttc gac gtg agg ttc aag aag tga 

Trp Phe Asp Val Arg Phe Lys Lys 

85 



48 



96 



<400> 17 

atg tct ttg ttc ccc ate gtg ggc cgc ttc gtg ggg gat ttc gtc ccc 

Met Ser Leu Phe Pro He Val Gly Arg Phe Val Gly Asp Phe Val Pro 
1 5 10 15 

cac ctg gtg gcg gtg gac acc tct gac acc ate gat cag ate gcc gag 

His Leu Val Ala Val Asp Thr Ser Asp Thr He Asp Gin He Ala Glu 

20 25 30 

aag gtg gcg gtc cac acg gtc ggg egg cgc ttg ccg ccc gat ccc acc 

Lys Val Ala Val His Thr Val Gly Arg Arg Leu Pro Pro Asp Pro Thr 

35 40 45 

gcc acc ggc tat gag gtg etc etc gac ggc gag acc ctg gac ggg ggc 192 
Ala Thr Gly Tyr Glu Val Leu Leu Asp Gly Glu Thr Leu Asp Gly Gly 
50 55 60 



144 



240 



267 



<210> 18 

<211> 88 

<212> PRT 

<213> Xanthobacta sp. 

<400> 18 

Met Ser Leu Phe Pro He Val Gly Arg Phe Val Gly Asp Phe Val Pro 
I 5 10 .15 

His Leu Val Ala Val Asp Thr Ser Asp Thr He Asp Gin He Ala Glu 

20 25 30 

Lys Val Ala Val His Thr Val Gly Arg Arg Leu Pro Pro Asp Pro Thr 
35 40 45 

Ala Thr Gly Tyr Glu Val Leu Leu Asp Gly Glu Thr Leu Asp Gly Gly 
50 55 60 

Ala Thr Leu Glu Ala He Met Thr Lys Arg Glu Met Leu Pro Leu Gin 
65 70 75 80 

Trp Phe Asp Val Arg Phe Lys Lys 

85 



<210> 19 

<211> 1584 

<212> DNA 

<213> Methylococcus capsulatas 



<220> 

<221> CDS 

<222> (1)..(1584) 

<400> 19 

atg gca ctt age acc gca acc aag gcc gcg acg gac gcg ctg get gcc 

Met Ala Leu Ser Thr Ala Thr Lys Ala Ala Thr Asp Ala Leu Ala Ala 
1 5 10 15 

aat egg gca ccc acc age gtg aat gca cag gaa gtg cac cgt tgg etc 
Asn Arg Ala Pro Thr Ser Val Asn Ala Gin Glu Val His Arg Trp Leu 

20 25 30 

cag age ttc aac tgg gat ttc aag aac aac egg acc aag tac gcc acc 
Gin Ser Phe Asn Trp Asp Phe Lys Asn Asn Arg Thr Lys Tyr Ala Thr 
35 40 45 

aag tac aag atg gcg aac gag acc aag gaa cag ttc aag ctg ate gcc 



48 



96 



144 



192 



55 



Lys Tyr Lys Met Ala Asn Glu Thr Lys Glu Gin Phe Lys Leu He Ala 
50 55 60 

aag gaa tat gcg cgc atg gag gca gtc aag gac gaa agg cag ttc ggt 
Lys Glu Tyr Ala Arg Met Glu Ala Val Lys Asp Glu Arg Gin Phe Gly 
65 70 75 80 

age ctg cag gat gcg ctg acc cgc etc aac gcc ggt gtt cgc gtt cat 
Ser Leu Gin Asp Ala Leu Thr Arg Leu Asn Ala Gly Val Arg Val His 

85 90 ~ 95 

ceg aag tgg aac gag acc atg aaa gtg gtt teg aac ttc ctg gaa gtg 
Pro Lys Trp Asn Glu Thr Met Lys Val Val Ser Asn Phe Leu Glu Val 

100 105 HO 

ggc gaa tac aac gcc ate gcc get acc ggg atg ctg tgg gat tec gcc 
Gly Glu Tyr Asn Ala He Ala Ala Thr Gly Met Leu Trp Asp Ser Ala 
115 120 125 

cag gcg gcg gaa cag aag aac ggc tat ctg gcc cag gtg ttg gat gaa 
Gin Ala Ala Glu Gin Lys Asn Gly Tyr Leu Ala Gin Val Leu Asp Glu 
130 135 140 

ate cgc cac acc cac cag tgt gcc tac gtc aac tac tac ttc gcg aag 
He Arg His Thr His Gin Cys Ala Tyr Val Asn Tyr Tyr Phe Ala Lys 
i45 150 155 " 160 

aac ggc cag gac ceg gcc ggt cac aac gat get cgc cgc acc cgt acc 
Asn Gly Gin Asp Pro Ala Gly His Asn Asp Ala Arg Arg Thr Arg Thr 

165 170 " 175 

ate ggt ceg ctg tgg aag ggc atg aag cgc gtg ttt tec gac ggc ttc 
He Gly Pro Leu Trp Lys Gly Met Lys Arg Val Phe Ser Asp Gly Phe 

180 185 190 

att tec ggc gac gcc gtg gaa tgc tec etc aac ctg cag ctg gtg ggt 
He Ser Gly Asp Ala Val Glu Cys Ser Leu Asn Leu Gin Leu Val Gly 
195 ' 200 205 

gag gcc tgc ttc acc aat ceg ctg ate gtc gca gtg acc gaa tgg get 
Glu Ala Cys Phe Thr Asn Pro Leu He Val Ala Val Thr Glu Trp Ala 
210 215 220 

gcc gcc .aac ggc gat gaa ate acc ceg acg gtg ttc ctg teg ate gag 
Ala Ala Asn Gly Asp Glu He Thr Pro Thr Val Phe Leu Ser He Glu 
225 230 235 240 

acc gac gaa ctg cgc cac atg gcc aac ggt tac cag acc gtc gtt tec 
Thr Asp Glu Leu Arg His Met Ala Asn Gly Tyr Gin Thr Val Val Ser 

245 250 255 

ate gcc aac gat ceg get tec gcc aag tat etc aac acg gac ctg aac 
He Ala Asn Asp Pro Ala Ser Ala Lys Tyr Leu Asn Thr Asp Leu Asn 

260 265 270 

aac gcc ttc tgg acc cag cag aag tac ttc acg ceg gtg ttg ggc atg 
Asn Ala Phe Trp Thr Gin Gin Lys Tyr Phe Thr Pro Val Leu Gly Met 
275 280 285 

ctg ttc gag tat ggc tec aag ttc aag gtc gag ceg tgg gtc aag acg 
Leu Phe Glu Tyr Gly Ser Lys Phe Lys Val Glu Pro Trp Val Lys Thr 
290 295 300 

tgg gac cgc tgg gtg tac gag gac tgg ggc ggc ate tgg ate ggc cgt 
Trp Asp Arg Trp Val Tyr Glu Asp Trp Gly Gly He Trp He Gly Arg 
305 310 315 320 

ctg ggc aag tac ggg gtg gag teg ceg cgc age etc aag gac gcc aag 
Leu Gly Lys Tyr Gly Val Glu Ser Pro Arg Ser Leu Lys Asp Ala Lys 

325 330 335 

cag gac get tac tgg get cac cac gac ctg tat ctg ctg get tat gcg 
Gin Asp Ala Tyr Trp Ala His His Asp Leu Tyr Leu Leu Ala Tyr Ala 

■ 340 345 ^ 350 

ctg tgg ceg acc ggc ttc ttc cgt ctg gcg ctg ceg gat cag gaa gaa 



56 



Leu Trp Pro Thr Gly Phe Phe Arg Leu Ala Leu Pro Asp Gin Glu Glu 
355 360 365 

atg gag tgg ttc gag gcc aac tac ccc ggc tgg tac gac cac tac ggc 
Met Glu Trp Phe Glu Ala Asn Tyr Pro Gly Trp Tyr Asp His Tyr Gly 
370 ^ 375 . 380 



aag ate tac gag gaa tgg cgc gcc cgc ggt tgc gag gat ccg tec teg 
Lvs lie Tyr Glu Glu Trp Arg Ala Arg Gly Cys Glu Asp Pro Ser Ser 
385 390 395 400 



ggc ttc ate ccg ctg atg tgg ttc ate gaa aac. aac cat ccc ate tac 
Glv Phe lie Pro Leu Met Trp Phe He Glu Asn Asn Hxs Pro He Tyr 

405 410 415 

ate gat cgc gtg teg caa gtg ccg ttc tgc ccg age ttg gcc aag ggc 
He Asp Arg Val Ser Gin Val Pro Phe Cys Pro Ser Leu Ala Lys Gly 
. 420 425 430 

gcc age acc ctg cgc gtg cac gag tac aac ggc gag atg cac ace ttc 
Ala Ser Thr Leu Arg Val His Glu Tyr Asn Gly Glu Met Hxs Thr Phe 
435 440 445 

age gac cag tgg ggc gag cgc atg tgg ctg gcc gag ccg gag cgc tac 
Ser Asp Gin Trp Gly Glu Arg Met Trp Leu Ala Glu Pro Glu Arg Tyr 



450 



455 460 



gag tgc cag aac ate ttc gaa cag tac gaa gga cgc gaa ctg teg gaa 
Glu Cys Gin Asn He Phe Glu Gin Tyr Glu Gly Arg Glu Leu Ser Glu 
465 470 475 480 

gtg ate gcc gaa ctg cac ggg ctg cgc agt gat ggc aag acc ctg ate 
val He Ala Glu Leu His Gly Leu Arg Ser Asp Gly Lys Thr Leu He 

485 490 495 

gcc cag ccg cat gtc cgt ggc gac aag ctg tgg acg ttg gac gat ate 
Ala Gin Pro His Val Arg Gly Asp Lys Leu Trp Thr Leu Asp Asp He 

500 505 510 

aaa cgc ctg aac tgc gtc ttc aag aac ccg gtg aag gca ttc aat tga 
Lys Arg Leu Asn Cys Val Phe Lys Asn Pro Val Lys Ala Phe Asn 
515 520 525 



<210> 20 

<211> 527 

<212> PRT 

<213> Methylococcus capsulatas 

<400> 20 

Met Ala Leu Ser Thr Ala Thr Lys Ala Ala Thr Asp Ala Leu Ala Ala 
1 5 10 15 

Asn Arg Ala Pro Thr Ser Val Asn Ala Gin Glu Val His Arg Trp Leu 

20 25 30 

Gin Ser Phe Asn Trp Asp Phe Lys Asn Asn Arg Thr Lys Tyr Ala Thr 
35 40 45 

Lys Tyr Lys Met Ala Asn Glu Thr Lys Glu Gin Phe Lys Leu He Ala 
50 55 60 

Lys Glu Tyr Ala Arg Met Glu Ala Val Lys Asp Glu Arg Gin Phe Gly 
65 70 75 80 

Ser Leu Gin Asp Ala Leu- Thr Arg Leu Asn Ala Gly Val Arg Val His 

85 9° 95 

Pro Lys Trp Asn Glu Thr Met Lys Val Val Ser Asn Phe Leu Glu Val 

1Q0 105 11° 

Gly Glu Tyr Asn Ala He Ala Ala Thr Gly Met Leu Trp Asp Ser Ala 
115 120 125 

Gin Ala Ala Glu Gin Lys Asn Gly Tyr Leu Ala Gin Val Leu Asp Glu 



1152 



1200 



1248 



1296 



1344 



1392 



1440 



1488 



1536 



1584 



57 



130 135 140 

He Arg His Thr His Gin Cys Ala Tyr Val Asn Tyr Tyr Phe Ala Lys 
145 150 155 160 

Asn Gly Gin Asp Pro Ala Gly His Asn Asp Ala Arg Arg Thr Arg Thr 

165 170 ~ 175 

He Gly Pro Leu Trp Lys Gly Met Lys Arg Val Phe Ser Asp Gly Phe 

180 185 190 

He Ser Gly Asp Ala Val Glu Cys Ser Leu Asn Leu Gin Leu Val Gly 
195 200 205 

Glu Ala Cys Phe Thr Asn Pro Leu He Val Ala Val Thr Glu Trp Ala 
210 215 220 

Ala Ala Asn Gly Asp Glu He Thr Pro Thr Val Phe Leu Ser He Glu 
225 230 235 240 

Thr Asp Glu Leu Arg His Met Ala Asn Gly Tyr Gin Thr Val Val Ser 

245 250 255 

He Ala Asn Asp Pro Ala Ser Ala Lys Tyr Leu Asn Thr Asp Leu Asn 

260 265 270 

Asn Ala Phe Trp Thr Gin Gin Lys Tyr Phe Thr Pro Val Leu Gly Met 
275 280 285 

Leu Phe Glu Tyr Gly Ser Lys Phe Lys Val Glu Pro Trp Val Lys Thr 
290 295 300 

Trp Asp Arg Trp Val Tyr Glu Asp Trp Gly Gly He Trp He Gly Arg 
305 310 315 ~ 320 

Leu Gly Lys Tyr Gly Val Glu Ser Pro Arg Ser Leu Lys Asp Ala Lys 

325 330 ' 335 

Gin Asp Ala Tyr Trp Ala His His Asp Leu Tyr Leu Leu Ala Tyr Ala 

340 345 350 

Leu Trp Pro Thr Gly Phe Phe Arg Leu Ala Leu Pro Asp Gin Glu Glu 
355 360 365 

Met Glu Trp Phe Glu Ala Asn Tyr Pro Gly Trp Tyr Asp His Tyr Gly 
370 375 " ~ 380 ' 

Lys He Tyr Glu Glu Trp Arg Ala Arg Gly Cys Glu Asp Pro Ser Ser 
385 390 395 400 

Gly Phe He Pro Leu Met Trp Phe lie Glu Asn Asn His Pro He Tyr 

405 410 415 

He Asp Arg Val Ser Gin Val Pro Phe Cys Pro Ser Leu Ala Lys Gly 

420 425 430 

Ala Ser Thr Leu Arg Val His Glu Tyr Asn Gly Glu Met His Thr Phe 
435 440 ' 445 

Ser Asp Gin Trp Gly Glu Arg Met Trp Leu Ala Glu Pro Glu Arg Tyr 
450 455 460 

Glu Cys Gin Asn He Phe Glu Gin .Tyr Glu Gly Arg Glu Leu Ser Glu 
465 470 475 480 

Val He Ala Glu Leu His Gly Leu Arg Ser Asp Gly Lys Thr Leu He 

485 490 495 

Ala Gin Pro His Val Arg Gly Asp Lys Leu Trp Thr Leu Asp Asp He 

500 505 510 

Lys Arg Leu Asn Cys Val Phe Lys Asn Pro Val Lys Ala Phe Asn 
515 520 525 



<210> 21 



58 



<211> 1170 
<212> DNA 

<213> Methylococcus capsulatas 



<220> 

<221> CDS 

<222> (1) . . (1170) 



<400> 21 

atg age atg tta gga gaa aga cgc cgc ggt ctg acc gat ccg gaa atg 
Met Ser Met Leu Gly Glu Arg Arg Arg Gly Leu Thr Asp Pro Glu Met 
1 5 10 15 

gcg gec gtc att ttg aag gcg ctt cct gaa get ccg ctg gac ggc aac 
Ala Ala Val lie Leu Lys Ala Leu Pro Glu Ala Pro Leu Asp Gly Asn 

20 25 30 

aac aag atg ggt tat ttc gtc acc ccc cgc tgg aaa cgc ttg acg gaa 
Asn Lys Met Gly Tyr Phe Val Thr Pro Arg Trp Lys Arg Leu Thr Glu 
35 40 45 

tat gaa gec ctg acc gtt tat gcg cag ccc aac gec gac tgg ate gee 
Tyr Glu Ala Leu Thr Val Tyr Ala Gin Pro Asn Ala Asp Trp He Ala 
50 55 . 60 

ggc ggc ctg gac tgg ggc gac tgg acc cag aaa ttc cac ggc ggc cgc 
Gly Gly Leu Asp Trp Gly Asp Trp Thr Gin Lys Phe His Gly Gly Arg 
65 70 75 80 

cct tec tgg ggc aac gag acc acg gag ctg cgc acc gtc gac tgg ttc 
Pro Ser Trp Gly Asn Glu Thr Thr Glu Leu Arg Thr Val Asp Trp Phe 

85 90 95 



aag gee gag gaa tgg cgc tac acc gac cgc ttc ctg cag ggt tac tec 
Lys Ala Glu Glu Trp Arg Tyr Thr Asp Arg Phe Leu Gin Gly Tyr Ser 
115 120 125 

gec gac ggt cag ate egg gcg atg aac ccg acc tgg egg gac gag ttc 
Ala Asp Gly Gin He Arg Ala Met Asn Pro Thr Trp Arg Asp Glu Phe 
130 135 140 

ate aac egg tat tgg ggc gee ttc ctg ttc aac gaa tac gga ttg ttc 
He Asn Arg Tyr Trp Gly Ala Phe Leu Phe Asn Glu Tyr Gly Leu Phe 
145 150 155 160 

aac get cat teg cag ggc gee egg gag gcg ctg teg gac gta acc cgc 
Asn Ala His Ser Gin Gly Ala Arg Glu Ala Leu Ser Asp Val Thr Arg 

165 170 175 

gtc age ctg get ttc tgg ggc ttc gac aag ate gac ate gee cag atg 
Val Ser Leu Ala Phe Trp Gly Phe Asp Lys He Asp He Ala Gin Met 

180 185 190 

ate caa etc gaa egg ggt ttc etc gee aag ate gta ccc ggt ttc gac 
He Gin Leu Glu Arg Gly Phe Leu Ala Lys He Val Pro Gly Phe Asp 
195 200 205 

gag tec aca gcg gtg ccg aag gee gaa tgg acg aac ggg gag gtc tac 
Glu Ser Thr Ala Val Pro Lys Ala Glu Trp Thr Asn Gly Glu Val Tyr 
210 215 220 

aag age gec cgt ctg gee gtg gaa ggg ctg tgg cag gag gtg ttc gac 
Lys Ser Ala Arg Leu Ala Val Glu Gly Leu Trp Gin Glu Val Phe Asp 
225 230 235 240 

tgg aac gag age get tte teg gtg cac gee gtc tat gac gcg ctg ttc 
Trp Asn Glu Ser Ala Phe Ser Val His Ala Val Tyr Asp Ala Leu Phe 

245 250 255 

ggt cag ttc gtc cgc cgc gag ttc ttt cag egg ctg get ccc cgc ttc 
Gly Gin Phe Val Arg Arg Glu Phe Phe Gin Arg Leu Ala Pro Arg Phe 



48 



96 



144 



192 



240 



288 



aag cac cgt gac ccg etc cgc cgt tgg cat gcg ccg tac gtc aag gac 33 6 

Lys His Arg Asp Pro Leu Arg Arg Trp His Ala Pro Tyr Val Lys Asp 

100 105 HO 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



59 



260 265 



270 



ggc gac aat ctg acg cca ttc ttc ate aac cag gec cag aca tac ttc 8 64 

Gly Asp Asn Leu Thr Pro Phe Phe lie Asn Gin Ala Gin Thr Tyr Phe 

275 „ . 280 285 

cag ate gec aag cag ggc gta cag gat ctg tat tac aac tgt ctg ggt 912 
Gin He Ala Lys Gin Gly Val Gin Asp Leu Tyr Tyr Asn Cys Leu Glv 
290 295 300 

gac gat ccg gag ttc age gat tac aac cgt acc gtg atg cgc aac tgg 9 60 

Asp Asp Pro Glu Phe Ser Asp Tyr Asn Arg Thr Val Met Arg Asn Trp 
305 315 320 

acc ggc aag tgg ctg gag ccc acg ate gee get ctg cgc gac ttc atg 1008 
Thr Gly Lys Trp Leu Glu Pro Thr He Ala Ala Leu Arg Asp Phe Met 

325 330 335 

ggg ctg ttt gcg aag ctg ccg gcg ggc acc act gac aag gaa gaa ate 1056 
Gly Leu Phe Ala Lys Leu Pro Ala Gly Thr Thr Asp Lys Glu Glu He 

340 345 * 350 

acc gcg tec ctg tac egg gtg gtc gac gac tgg ate gag gac tac gee 1104 
Thr Ala Ser Leu Tyr Arg Val Val Asp Asp Trp He Glu Asp Tyr Ala 
355 360 365 

age gcg ate gac ttc aag gcg gac cgc gat cag ate gtt aaa gcg gtt 1152 
Ser Ala He Asp Phe Lys Ala Asp Arg Asp Gin He Val Lys Ala Val 
370 375 380 



ctg gca gga ttg aaa taa 
Leu Ala Gly* Leu Lys 
385 



<210> 22 

<211> 389 

<212> PRT 

<213> Methylococcus capsulatas 



<400> 22 

* 

Met Ser Met Leu Gly Glu Arg Arg Arg Gly Leu Thr Asp Pro Glu Met 
15 10 15 

Ala Ala Val He Leu Lys Ala Leu Pro Glu Ala Pro Leu Asp Gly Asn 

20 - 25 30 

Asn Lys Met Gly Tyr Phe Val Thr Pro Arg Trp Lys Arg Leu Thr Glu 
35 40 45 

Tyr Glu Ala Leu Thr Val Tyr Ala Gin Pro Asn Ala Asp Trp He Ala 
50 55 60 

Gly Gly Leu Asp Trp Gly Asp Trp Thr Gin Lys Phe His Gly Gly Arg 
65 70 75 80 

Pro Ser Trp Gly Asn Glu Thr Thr Glu Leu Arg Thr Val Asp Trp Phe 

85 90 95 

Lys His Arg Asp Pro Leu Arg Arg Trp His Ala Pro Tyr Val Lys Asp 

100 105 ' no 

Lys Ala Glu Glu Trp Arg Tyr Thr Asp Arg Phe Leu Gin Gly Tyr Ser 
115 120 125 

Ala Asp Gly Gin He Arg Ala Met Asn Pro Thr Trp Arg Asp Glu Phe 
130 135 140 

He Asn Arg Tyr Trp Gly Ala Phe Leu Phe Asn Glu Tyr Gly Leu Phe 
145 150 . 155 ~ 160 

Asn Ala His Ser Gin Gly Ala Arg Glu Ala Leu Ser Asp Val Thr Arg 

•165 170 175 

Val Ser Leu Ala Phe Trp Gly Phe .Asp Lys He Asp He Ala Gin Met 



1170 



60 



180 185 



190 



He Gin Leu Glu Arg Gly Phe Leu Ala Lys He Val Pro Gly Phe Asp 
195 200 205 

Glu Ser Thr Ala Val Pro Lys Ala Glu Trp Thr Asn Gly Glu Val Tyr 
210 215 220 

Lys Ser Ala Arg Leu Ala Val Glu Gly Leu Trp Gin Glu Val Phe Asp 
225 230 235 240 

Trp Asn Glu Ser Ala Phe Ser Val His Ala Val Tyr Asp Ala Leu Phe 

245 250 255 

Gly Gin Phe Val Arg Arg Glu Phe Phe Gin Arg Leu Ala Pro Arg Phe 

260 265 270 

Gly Asp Asn Leu Thr Pro Phe Phe He Asn Gin Ala Gin Thr Tyr Phe 
275 280 285 

Gin He Ala Lys Gin Gly Val Gin Asp Leu Tyr Tyr Asn Cys Leu Gly 
290 295 300 

Asp Asp Pro Glu Phe Ser Asp Tyr Asn Arg Thr Val Met Arg Asn Trp 
305 310 315 32U 

Thr Gly Lys Trp Leu Glu Pro Thr He Ala Ala Leu Arg Asp Phe Met 

325 330 w 

Gly Leu Phe Ala Lys Leu Pro Ala Gly Thr Thr Asp Lys Glu Glu He 

340 345 350 

Thr Ala Ser Leu Tyr Arg Val Val Asp Asp Trp He Glu Asp Tyr Ala 
355 360 365 

Ser Ala He Asp Phe Lys Ala Asp Arg Asp Gin He Val Lys Ala Val 
370 375 380 

Leu Ala Gly Leu Lys 
385 



<210> 23 

<2H> 513 

<212> DNA 

<213> Methyl ococcus capsulatas 



<220> 

<221> CDS 

<222> (1) . . (513) 

ttl% C Q 2 laa ctg ggt ata cac age aac gac acc cgc gac gec tgg gtg 
Me? Ala Lys Leu Sy He His Ser Asn Asp Thr Arg Asp Ala Trp Val 
1 5 10 15 

aac aag ate gcg cag etc aac acc ctg gaa aaa gcg gee gag atg ctg 
Asn Lys lie 111 Gin Leu Asn Thr Leu Glu Lys Ala Ala Glu Met Leu 

20 25 30 

aaa caa ttc egg atg gac cac acc acg ccg ttc cgc aac age tac gaa 
Lys Gin Phe A^g Me? Asp His Thr Thr Pro Phe Arg Asn Ser Tyr Glu 
35 40 

ctg gac aac gac tac etc tgg ate gag gec aag etc gaa gag aag gtc 
Leu Asp Asn Asp Tyr Leu Trp He Glu Ala Lys Leu Glu Glu Lys Val 



50 



55 60 



acc ate etc aag gca cgc gee ttc aac gag gtg gac ttc cgt cat aag 
Ala Val Leu Lyl Ala Arg Ala Phe Asn Glu Val Asp Phe Arg Hrs Lys 

nn 75 ° u 

65 70 

ace get ttc ggc gag gat gee aag tec gtt ctg gac gge acc gtc gcg 
Thr Ala Phe Gly Glu Asp Ala Lys Ser Val Leu Asp Gly Thr Val Ala 

85 90 95 



48 



96 



144 



192 



240 



288 



61 



155 igo 



gtg egg gtg gtg cat ctg cag teg ccg cac tga 
Val Arg Val Val His Leu Gin Ser Pro His 

165 170 



<210> 24 

<211> 170 

<212> PRT 

<213> Methylococcus capsulatas 

<400> 24 



Met Ala Lys Leu Gly He His Ser Asn Asp Thr Arg Asp Ala Trp Val 
1 5 10 is 

Asn Lys lie Ala Gin Leu Asn Thr Leu Glu Lys Ala Ala Glu Met Leu 

20 25 30 

Lys Gin Phe Arg Met Asp His Thr Thr Pro Phe Arg Asn Ser Tyr Glu 
35 40 45 

Leu Asp Asn Asp Tyr Leu Trp lie Glu Ala Lys Leu Glu Glu Lys Val 
50 55 



. 60 



Ala Val Leu Lys Ala Arg Ala Phe Asn Glu Val Asp Phe Arg His Lys 

Thr Ala Phe Gly Glu Asp Ala Lys Ser Val Leu Asp Gly Thr Val Ala 

85 90 " 95 

Lys Met Asn Ala Ala Lys Asp Lys Trp Glu Ala Glu Lys He His He 

10 °. 105 4 no 

Gly Phe Arg Gin Ala Tyr Lys Pro Pro He Met Pro .Val Asn Tyr Phe 
115 120 125 

Leu Asp Gly Glu Arg Gin Leu Gly Thr Arg Leu Met Glu Leu Arg Asn 
130 135 140 

Leu Asn Tyr Tyr Asp Thr Pro Leu Glu Glu Leu Arg Lys Gin Arg Gly 
145 150 155 i 6 o 

Val Arg Val Val His Leu Gin Ser Pro His 

165 170 



336 



384 



rll m ? a ?? 9 aag gaC aag tgg gag *cg gag aag ate cat ate 

Lys Met Asn Ala Ala Lys Asp Lys Trp Glu Ala Glu Lys He His He 

100 105 no 

2?* ^° ? g ° °f g gCC tac aag ccg ccg atc at <? ccg gtg aac tat ttc 
Gly Phe Arg Gin Ala Tyr Lys Pro Pro He Met Pro Val Asn Tyr Phe 

115 120 125 

ctg gac ggc gag cgt cag ttg ggg acc egg ctg atg gaa ctg cgc aac 432 
Leu Asp Gly Glu Arg Gin Leu Gly Thr Arg Leu Me? Glu Leu A^g Asn 
130 ' 135 140 

T Ct , C £ aC gaC aCg CCg ctg gaa gaa ct( 3 cgc aaa cag cgc ggt 

Leu Asn Tyr Tyr Asp Thr Pro Leu Glu Glu Leu Arg Lys Gin Arg Gly 
14 5 150 



480 



513 



<210> 25 

<211> 1206 

<212> DNA 

<213> Pseudomonas oleovorans 



<220> 

<221> CDS 

<222> (1).. . (1206) 



<400> 25 

atg ctt gag aaa cac aga gtt ctg gat tec get cca gag tac gta gat 48 
Met Leu Glu Lys His Arg Val Leu Asp Ser Ala Pro Glu Tyr Val Asp 



62 



x 

aaa aag aaa tat etc tgg ata eta tea act ttg tgg ccg get act ccg 
Lys Lys Lys Tyr Leu Trp He Leu Ser Thr Leu Trp Pro Ala Thr Pro 

20 25 

*t.a ate aaa ate tgg ctt gca aat gaa act ggt tgg ggg att ttt tat 
nil He Gly He Trp Leu La Asn Glu Thr Oly Trp Sly He Phe Tyr 

35 40 , 

aaa etg gta ttg etc gta tgg tac ggc gca ctt eca ttg ctt gat gcg 
Gly Leu Val Leu Leu Val Trp Tyr Gly Ala Leu Pro Leu Leu Asp Ala 
50 55 60 

ata ttt ggt gag gac ttt aat aat ccg cct gaa gaa gtg gtg ccg aaa 
Mer P^e IS Lu Lp Phe Asn Asn Pro Pro Glu Glu Val Val Pro Lys 
65 ™ 75 

S III SI S 3 S S 5 IS 3 S S5 2 S f - 

s S S S K £ s s s s s s s ss s a 

eca atg tct tgg ctt gaa att ggt gcg ctt gec ttg tea etg ggt ate 
Pro Me? Ser Trp Leu Glu He Gly Ala Leu Ala Leu Ser Leu Gly He 
115 120 125 

„ra aac aaa eta gcg etc aat aca gga cac gaa etc ggt cac aag aag 
fl Asn Gly Leu 111 Leu Asn Thr Gly His Glu Leu Gly Hrs Lys Lys 
130 135 I 40 

gag act ttt gat cgt tgg atg gec aaa att gtg ttg get gte gta ggg 
Glu Thr Phe Asp Arg Trp Met Ala Lys He Val Leu Ala Val Val Gly 
145 I 50 " 

,.„ _ ot cac ttc ttt att gag cat aat aag ggt cat cac cgt gat gte 
Tyr gS H^s Phe Phe He Glu His Asn Lys Gly His His Arg Asp Val 

get aca ccg atg gat cct gca aca tee egg atg gga gaa age att tat 
Ala Thr Pro Met Asp Pro Ala Thr Ser Arg Met Gly Glu Ser lie lyr 

180 I 85 

aag ttt tea ate cgt gag ate eca gga gca ttt att cgt get tgg ggg 
Lys Phe Ser He Arg Glu He Pro Gly Ala Phe He Arg Ala Trp Gly 
195 200 

• ctt gag gaa eaa cgc ctt teg cgc cgt ggc caa age gtt tgg agt ttc 
Leu Glu Glu Gin Arg Leu Ser Arg Arg Gly Gin Ser Val Trp Ser Pne 
210 • 215 220 

gat aat gaa ate etc eaa eca atg ate ate aca gtt att ctt tac gee 
Asp Asn Glu He Leu Gin Pro Met He He Thr Val lie Leu xy 
225 230 235 

gtt etc ctt gee ttg ttt gga cet aag atg etg gtg ttc etg ccg att 
Val Leu Leu Ala Leu Phe Gly Pro Lys Met Leu Val Phe Leu Pro He 

245 250 

caa atg get ttc ggt tgg tgg cag etg acc agt gcg aac tat att gaa 
Gin Met Ala Phe Gly Trp Trp Gin Leu Thr Ser Ala Asn Tyr He Glu 

260 265 

rat tac aac ttg etc cgt caa aaa atg gag gac ggt cga tat gag cat 
His Tyr 25 ^eu Leu Arg Gin Lys Met Glu Asp Gly Arg Tyr Glu Hxs 
275 280 285 

caa aag ccg cac cat tct tgg aat agt aat cac ate gte tct aat eta 
Gin Lys Pro His His Ser Trp Asn Ser Asn His lie Val Ser Asn u 
290 295 300 

gtg etg ttc cac ctt cag egg cac teg gat cac cac gcg cat eca aca 
Va? Leu Phe His Leu Gin Arg His Ser Asp Hxs His Ala Has Pro Thr 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



63 



305 310 



315 320 



tl 2? C agt agt gct ggt cat agt tc ^ acc tct gcg gta gca 

Phe Gly Thr Ser Ser Ala Gly His Ser Ser Ser Thr Ser Ala Val Ala 
385 390 395 40Q 



teg tag 
Ser 



<210> 26 

<211> 401 

<212> PRT 

<213> Pseudomonas oleovorans 

<400> 26 

Met Leu Glu Lys His Arg Val Leu Asp Ser Ala Pro Glu Tyr Val Asp 
1 5 10 15 

Lys Lys Lys Tyr Leu Trp lie Leu Ser Thr Leu Trp Pro' Ala Thr Pro 

20 . 25 30 

Met lie Gly lie Trp Leu Ala Asn Glu Thr Gly Trp Gly He Phe Tyr 
35 40 45 

Gly Leu Val Leu Leu Val Trp Tyr Gly Ala Leu Pro Leu Leu Asp Ala 
50 55 6 o 

Met Phe Gly Glu Asp Phe Asn Asn Pro Pro Glu Glu Val Val Pro Lvs 
65 70 75 so 

Leu Glu Lys Glu Arg Tyr Tyr Arg Val Leu Thr Tyr Leu Thr Val Pro 

85 90 95 

Met His Tyr Ala Ala Leu He Val Ser Ala Trp Trp Val Gly Thr Gin 

100 105 no 

Pro Met Ser Trp Leu Glu He Gly Ala Leu Ala Leu Ser Leu Gly He 
H5 120 125 

Val Asn Gly Leu Ala Leu Asn Thr Gly His Glu Leu Gly His Lys Lys 
130 135 140 

Glu Thr Phe Asp Arg Trp Met Ala Lys He Val Leu Ala Val Val Gly 
145 150 155 leo 

Tyr Gly His Phe Phe He Glu His Asn Lys Gly His His Arg Asp Val 

165 ivo 175 

Ala Thr Pro Met Asp Pro Ala Thr Ser Arg Met Gly Glu Ser He Tyr 

180 i 8 5 19Q 

Lys Phe Ser He Arg Glu He Pro Gly Ala Phe He Arg Ala Trp Gly 
195 200 205 

Leu Glu Glu Gin Arg Leu Ser Arg Arg Gly Gin Ser Val Trp Ser Phe 
21° 215 220 



1056 



III T I ^ 9 Ctt ° gg gat tfct CCC g ^ c ct g c cg get ctt ccg 1008 

Arg Ser Tyr Gin Ser Leu Arg Asp Phe Pro Gly Leu Pro Ala Leu Pro 

325 330 335 

acg ggt tac cct ggt gca ttt ttg atg gcg atg att cct cag tgg ttt 
Thr Gly Tyr Pro Gly Ala Phe Leu Met Ala Met He Pro Gin Trp Phe 

340 345 350 

aga tea gtt atg gat ccc aag gta gta gat tgg get ggt ggt gac ctt 1104 
Arg Ser Val Met Asp Pro Lys Val Val Asp Trp Ala Gly Gly Asp Leu 
355 360 365 

aat aag ate caa att gat gat teg atg cga gaa acc tat ttg aaa aaa 1152 
Asn Lys He Gin He Asp Asp Ser Met Arg Glu Thr Tyr Leu Lys Lys 
370 375 380 



1200 



1206 



64 



Asp Asn Glu He 
225 



Val Leu Leu Ala 



Gin Met Ala Phe 

260 

His Tyr Gly Leu 
275 

Gin Lys Pro His 
290 

Val Leu Phe His 
305 

' Arg Ser Tyr Gin 



Thr Gly Tyr Pro 

340 

Arg Ser Val Met 
355 

Asn Lys He Gin 
370 

Phe Gly Thr Ser 
385 

Ser 



Leu Gin Pro Met 
230 



Leu Phe Gly Pro 
245 

Gly Trp Trp Gin 



Leu Arg Gin Lys 

280 



His Ser Trp Asn 
295 

Leu Gin Arg His 
310 

Ser Leu Arg Asp 
325 

Gly Ala Phe Leu 



Asp Pro Lys Val 

3 60 



He Asp Asp Ser 
375 

Ser Ala Gly His 
390 



He He Thr Val 
235 



Lys Met Leu Val 
250 

Leu Thr Ser Ala 
265 

Met Glu Asp Gly 



Ser Asn His He 

300 



Ser Asp His His 
315 

Phe Pro Gly Leu 
330 

Met Ala Met He 
345 

Val Asp Trp Ala 



Met Arg Glu Thr 

380 



Ser Ser Ser Thr 
395 



He Leu Tyr Ala 

240 



Phe Leu Pro He 
255 

Asn Tyr He Glu 
270 

Arg Tyr Glu His 
285 

Val Ser Asn Leu 



Ala His Pro Thr 

320 



pro Ala Leu Pro 
335 

Pro Gin Trp Phe 
350 

Gly Gly Asp Leu 
365 

Tyr Leu Lys Lys 



Ser Ala Val Ala 

400 



<210> 27 

<211> 1560 

<212> DNA 

<213> Burkholderia cepacia 



<220> 

<221> CDS 

<222> (1) • • (1560) 

<400> 27 



atg gac act tct gtg cag aag aag aaa etc ggt tta aag aat cgc tac 
Met Asp Thr Ser Val Gin Lys Lys Lys Leu Gly Leu Lys Asn Arg Tyr 
1 5 _ 1° 

gca gcg atg acc cgc ggt ctt ggc tgg cag acc age tac cag ccg atg 
La Ala Met Thr Arg Gly Leu Gly Trp Gin Thr Ser Tyr Gin Pro Met 



20 25 



gag aaa gtg ttt ccg tac gac aag tac gaa ggc ate aag ate cac gat 
Glu Lys Val Phe Pro Tyr Asp Lys Tyr Glu Gly He Lys He Ota Asp 
35 40 45 

S 5 "* S 5 S S K S S K C K S S 

5 5 



50 



aaa tat cag ggc gag aag gaa aaa aag ctt tac gee gtc ate gac get 
Lys Tyr Gin Gly Glu Lys Glu Lys Lys Leu Tyr Ala Val He Asp Ala 

70 75 



65 



ttc gcg cag aac aac ggg cag ttg age att tec gac gcg cga tat gtc 
Phe Ala Gin Asn Asn Gly Gin Leu Ser He Ser Asp Ala Arg Tyr Val 



85 9° 



aac gca etc aag gtg ttt ate cag ggt gtg aca ccg ttg gag tat atg 
Asn Ala Leu Lys Val Phe He Gin Gly Val Thr Pro Leu Glu Tyr Met 



100 i° 5 



48 



96 



144 



192 



240 



288 



336 



65 



gca cac cga ggt ttt gcc cac att ggt egg cat ttt acg ggt gaa ggg 38 4 

Ala His Arg Gly Phe Ala His He Gly Arg His Phe Thr Gly Glu Gly 
115 120 125 ' 

gca cgt gtt get tgc cag atg cag tec ate gac gag ctg cgt cac ttc 432 
Ala Arg Val Ala Cys Gin Met Gin Ser He Asp Glu Leu Arg His Phe 
130 135 140 

cag ace gaa atg cat get etc teg cac tac aac aag tat ttt aac ggt 480 
Gin Thr Glu Met His Ala Leu Ser His Tyr Asn Lys Tyr Phe Asn Gly 
145 ' 150 155 16 o 

ctg cac aac tec ate cat tgg tac gac egg gtt tgg tat ttg teg gtg 528 
' Leu His Asn Ser He His Trp Tyr Asp Arg Val Trp Tyr Leu Ser Val 

165 170 175 

ccc aag tea ttt ttt gaa gac gcg gcc ace ggt gga ccg ttc gag ttt 57 6 

Pro Lys Ser Phe Phe Glu Asp Ala Ala Thr Gly Gly Pro Phe Glu Phe 

180 185 190 

ctt ace gcg gtg age ttt teg ttc gaa tat gtg ttg ace aac ctg ctg 624 
Leu Thr Ala Val Ser Phe Ser Phe Glu Tyr Val Leu Thr Asn Leu Leu 
195 200 205 

ttt gtc ccc ttc atg teg ggt get get tac aac ggg gac atg tct acg 672 
Phe Val Pro Phe Met Ser Gly Ala Ala Tyr Asn Gly Asp Met Ser Thr 
210 215 220 

lu° " t tCg gCg Caa agt gac gaa tcg C< 3 C cac atg aca 720 

Val Thr Phe Gly Phe Ser Ala Gin Ser Asp Glu Ser Arg His Met Thr 

225 2 30 235 240 

T^n ri* n 9C *** ttC * tg ^ °^ ^ CCg gaC aac ?68 

Leu Gly He Glu Cys He Lys Phe Met Leu Glu Gin Asp Pro Asp Asn 

245 250 255 

gtg ccc ate gtg cag cgc tgg ate gac aag tgg' ttc tgg cgc ggc tat 816 
Val Pro He Val Gin Arg Trp He Asp Lys Trp Phe Trp Arg Gly Tyr 

26 0 265 270 

egg ctg ttg age ate gtg gcc atg atg cag gac tac atg ctg ccc aac 8 64 

Arg Leu Leu Ser He Val Ala Met Met Gin Asp Tyr Met Leu Pro Asn 
275 280 285 

t^i tgg Cg ° gag agC tgg gag atg tac 9 tc gag cag aac 912 

Arg Val Met Ser Trp Arg Glu Ser Trp Glu Met Tyr Val Glu Gin Asn 

29 ° 295 300 

ggc ggc gcg ctg ttc aag gat ctt gcg cgt tat ggc ate cgc aag ccc 960 
Gly Gly Ala Leu Phe Lys Asp Leu Ala Arg Tyr Gly He Arg Lys Pro 
305 310 315 320 

aag ggc tgg gac cag get tgc gaa ggc aag gac cac ate age cat cag 1008 
Lys Gly Trp Asp Gin Ala Cys Glu Gly Lys Asp His He Ser His Gin 

325 330 335 

Th^ Phf 11° tat aaC gCC gCg gCC CCC atc cac acc 105 6 

Thr Phe Ala Val Phe Tyr Asn Tyr Asn Ala Ala Ala Pro He His Thr 

340 345 350 

tgg gtt ccc aca aaa gaa gaa atg gga tgg ctg teg gag aag tac ccc 1104 
Trp Val Pro Thr Lys Glu Glu Met Gly Trp Leu Ser Glu Lys Tyr Pro 
355 36o 365 

^ ^° gaC aag tat tac cgt ccg cgt tg 9 9* c ta <= tgg cgc gag 1152 
Glu Thr Phe Asp Lys Tyr Tyr Arg Pro Arg Trp Asp Tyr Trp Arq Glu 
370 375 ' 380 y 



cag gcc gcc aag ggc aac cgt ttc tac aac aag acg ctg ccg atg etc 
Gin Ala Ala Lys Gly Asn Arg Phe Tyr Asn Lys Thr Leu Pro Met Leu 
385 390 395 . 400 

tgc act acc tgc cag att ccg atg ata ttc acc gag cct ggc gac gca 
Cys Thr Thr Cys Gin He Pro Met He Phe Thr Glu Pro Gly Asp Ala 

405 4io 415 



1200 



1248 



66 



acc aag ate tgc tat cgc gag teg gec tac etc ggc gac aag tat cac 

Thr Lys He Cys Tyr Arg Glu Ser Ala Tyr Leu Gly Asp Lys Tyr His 

420 425 430 

ttc tgc age gac cac tgc aag gag att ttt gac aac gaa ccc gaa aag 

Phe Cys Ser Asp His Cys Lys Glu He Phe Asp Asn Glu Pro Glu Lys 

435 * 440 " 445 

ttc gtg cag tea tgg ctt ccg ccg cag caa gtg tat caa gga aac tgt 

Phe Val Gin Ser Trp Leu Pro Pro Gin Gin Val Tyr Gin Gly Asn Cys 

450 455 460 

ttc aag ccg gat gec gat ccg acc aag gag ggt ttt gat ccc ttg atg 

Phe Lys Pro Asp Ala Asp Pro Thr Lys Glu Gly Phe Asp Pro Leu Met 
465 470 " 475 480' 

gec ttg etc gac tac tac aac ctg aat gta ggc egg gac aac ttc gat 

Ala Leu Leu Asp Tyr Tyr Asn Leu Asn Val Gly Arg Asp Asn Phe Asp 

485 490 495 

ttc gag gga teg gaa gac caa aag aac ttt get gee tgg cgt gga gag 

Phe Glu Gly Ser Glu Asp Gin Lys Asn Phe Ala Ala Trp Arg Gly Glu 

500 505 510 

gtc ttg caa gga gaa gee aaa tga 

Val Leu Gin Gly Glu Ala Lys 
515 



<210> 28 

<211> 519 

<212> PRT 

<213> Burkholderia cepacia 

<400> 28 

Met Asp Thr Ser Val Gin Lys Lys Lys Leu Gly Leu Lys Asn Arg Tyr 
1 5 - • io i5 

Ala Ala Met Thr Arg Gly Leu Gly Trp Gin Thr Ser Tyr Gin Pro Met 

20 ~ 25 30 

Glu Lys Val Phe Pro Tyr Asp Lys Tyr Glu Gly He Lys lie His Asp 
35 40 45 

Trp Asp Lys Trp Glu Asp Pro Phe Arg Leu Thr Met Asp Ala Tyr Trp 
1 50 55 60 

Lys Tyr Gin Gly Glu Lys Glu Lys Lys Leu Tyr Ala Val He Asp Ala 
65 70 75 80 

Phe Ala Gin Asn Asn Gly Gin Leu Ser He Ser Asp Ala Arg Tyr Val 

85 90 95 

Asn Ala Leu Lys Val Phe He Gin Gly Val Thr Pro Leu Glu Tyr Met 

100 105 ' 110 

Ala His Arg Gly Phe Ala His He Gly Arg His Phe Thr Gly Glu Gly 
115 120 125 

Ala Arg Val Ala Cys Gin Met Gin Ser He Asp Glu Leu Arg His Phe 
130 135 140 

Gin Thr Glu Met His Ala Leu Ser His Tyr Asn Lys Tyr Phe Asn Gly 
145 150 155 160 

Leu His Asn Ser He His Trp Tyr Asp Arg Val Trp Tyr Leu Ser Val 

165 170 175 

Pro Lys Ser Phe Phe Glu Asp Ala Ala Thr Gly Gly Pro Phe Glu Phe 

180 185 190 



Leu Thr Ala Val Ser Phe Ser Phe Glu Tyr Val Leu Thr Asn Leu Leu 
195 200 205 
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Phe Val Pro Phe Met Ser Gly Ala Ala Tyr Asn Gly Asp Met Ser Thr 
210 215 220 

Val Thr Phe Gly Phe Ser Ala Gin Ser Asp Glu Ser Arg His Met Thr 
225 230 235 - 240 

Leu Gly lie Glu Cys lie Lys Phe Met Leu Glu Gin Asp Pro Asp Asn 

245 250 255 

Val Pro lie Val Gin Arg Trp He Asp Lys Trp Phe Trp Arg Gly Tyr 

260 265 270 

Arg Leu Leu Ser He Val Ala Met Met Gin Asp Tyr Met Leu Pro Asn 
275 280 285 

Arg Val Met Ser Trp Arg Glu Ser Trp Glu Met Tyr Val Glu Gin Asn 
290 295 300 

Gly Gly Ala Leu Phe Lys Asp Leu Ala Arg Tyr Gly He Arg Lys Pro 
305 310 315 " 320 

Lys Gly Trp Asp Gin Ala Cys Glu Gly Lys Asp His He Ser His Gin 

325 330 335 

Thr Phe Ala Val Phe Tyr Asn Tyr Asn Ala Ala Ala Pro He His Thr 

340 345 350 

Trp Val Pro Thr Lys Glu Glu Met Gly Trp Leu Ser Glu Lys Tyr Pro 
355 360 365 

Glu Thr Phe Asp Lys Tyr Tyr Arg Pro Arg Trp Asp Tyr Trp Arg Glu 
370 375 380 

Gin Ala Ala Lys Gly Asn Arg Phe Tyr Asn Lys Thr Leu Pro Met Leu 
385 390 395 400 

Cys Thr Thr Cys Gin He Pro Met He Phe Thr Glu Pro Gly Asp Ala 

405 410 415 

Thr Lys He Cys Tyr Arg Glu Ser Ala Tyr Leu Gly Asp Lys Tyr His 

420 425 ^ 430 

Phe Cys Ser Asp His Cys Lys Glu He Phe Asp Asn Glu Pro Glu Lys 
435 440 445 

Phe Val Gin Ser Trp Leu Pro Pro Gin Gin Val Tyr Gin Gly Asn Cys 
450 455 460 

Phe Lys Pro Asp Ala Asp Pro Thr Lys Glu Gly Phe Asp Pro Leu Met 
465 470 475 480 

Ala Leu Leu Asp Tyr Tyr Asn Leu Asn Val Gly Arg Asp Asn Phe Asp 

485 490 495 

Phe Glu Gly Ser Glu Asp Gin Lys Asn Phe Ala Ala Trp Arg Gly Glu 

500 505 510 

Val Leu Gin Gly Glu Ala Lys 
515 



<210> 29 

<211> 996 

<212> DNA 

<213> Burkholderia cepacia 



<220> 

<221> CDS 

<222> (1) . . (996) 

<400> 29 

atg acc ate gat ttg aag acg egg gaa ate aaa cca ctg cgt cac acc 
Met Thr He Asp Leu Lys Thr Arg Glu He Lys Pro Leu Arg His Thr 
1 5 io 15 



68 



tac acg cac gtg get caa tac ate ggg gec gat aaa gec get teg cgc 96 

Tyr Thr His Val Ala Gin Tyr lie Gly Ala Asp Lys Ala Ala Ser Arg 

20 ~ 25 30 

tat cag gaa ggc act gta ggt get caa ccc gca gcg aat ttt cat tac 144 

Tyr Gin Glu Gly Thr Val Gly Ala Gin Pro Ala Ala Asn Phe His Tyr 

35 40 45 

egg ccc acg tgg gat ccc gag cat gaa ctg ttc gac acg teg cgt acc 192 

Arg Pro Thr Trp Asp Pro Glu His Glu Leu Phe Asp Thr Ser Arg Thr 

5Q 55 60 

gcg att caa atg aag gac tgg tat gcg ctg aaa gac ccg cgt cag ttc 240 

Ala lie Gin Met Lys Asp Trp Tyr Ala Leu Lys Asp Pro Arg Gin Phe 

65 70 75 80 

tac tac gcg teg tgg. acg atg acc cga gcg egg cag caa gac gcg atg 288 

Tyr Tyr Ala Ser Trp Thr Met Thr Arg Ala Arg Gin Gin Asp Ala Met 

85 90 95 

gaa tec aac ttc gag ttt gtc gag teg cgc ggc atg ate gat etc gtt 336 

Glu Ser Asn Phe Glu Phe Val Glu Ser Arg Gly Met lie Asp Leu Val 

100 105 110 

tec gat gag gtt cga caa egg gcg ctt tec gtt etc gtg cct ttg cgt 384 

Ser Asp Glu Val Arg Gin Arg Ala Leu Ser Val Leu Val Pro Leu Arg 

115 120 125 

cac gcg gee tgg ggc gcg aac atg aac aac tec cag ate tgt gec eta 432 

His Ala Ala Trp Gly Ala Asn Met Asn Asn Ser Gin lie Cys Ala Leu 

130 135 140 

ggt tat ggc acg acc ttc act gcg ccg get atg ttc cac gca atg gac 480 

Gly Tyr Gly Thr Thr Phe Thr Ala Pro Ala Met Phe His Ala Met Asp 

145 150 155 160 

aat ctg ggt gta gcg cag tat etc aca cga ctg gcg ctg gta atg tct 528 

Asn Leu Gly Val Ala Gin Tyr Leu Thr Arg Leu Ala Leu Val Met Ser 

165 170 175 

gga ccc gat ctt ctt gac gaa gee aag caa gec tgg atg acg agt ccc 576 

Gly Pro Asp Leu Leu Asp Glu Ala Lys Gin Ala Trp Met Thr Ser Pro 

180 185 190 

gat tgg caa ccg ttg cgt cgt tat gtg gaa aac act ctg gtg ctg caa 624 

Asp Trp Gin Pro Leu Arg Arg Tyr Val Glu Asn Thr Leu Val Leu Gin 

195 200 205 

gat ccg gtg gaa ctg ttc ate gec caa aat ctg gcg etc gac ggt ctt 672 

Asp Pro Val Glu Leu Phe lie Ala Gin Asn Leu Ala Leu Asp Gly Leu 

210 215 220 

ctt tat ccc atg ate tac ggc get ttc gtc gac gat tac ate gca etc 720 

Leu Tyr Pro Met lie Tyr Gly Ala Phe Val Asp Asp Tyr lie Ala Leu 

225 230 235 240 

aac ggt ggt age gca gtg gca atg eta acc act ttc atg ccc gag tgg 7 68 

Asn Gly Gly Ser Ala Val Ala Met Leu Thr Thr Phe Met Pro Glu Trp 

245 250 255 

cat gac gaa tec agt cgc tgg gtc gat gcg gta gta aag acc atg gcg 816 

His Asp Glu Ser Ser Arg Trp Val Asp Ala Val Val Lys Thr Met Ala 

260 265 . 270 

acg gaa teg gag gat aac aaa gcg ctg etc att cac tgg ttg cgt acc 8 64 

Thr Glu Ser Glu Asp Asn Lys Ala Leu Leu lie His Trp Leu Arg Thr 

275 280 285 

tgg gaa gat cag gcg gcg tea gcg ttg ttg cct gtc get gaa atg get 912 

Trp Glu Asp Gin Ala Ala Ser Ala Leu Leu Pro Val Ala Glu Met Ala 

290 295 300 

ttg gcg gaa aac ggc cac gac gec ttg gaa gaa gta agg cag caa ctt 9 60 

Leu Ala Glu Asn Gly His Asp Ala Leu Glu Glu Val Arg Gin Gin Leu 

305 310 315 320 
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cgt gcc cgc gtt gcg aag gcc ggg att gtt ctg taa 
Arg Ala Arg Val Ala Lys Ala Gly He Val Leu 

325 330 

<210> 30 
<211> 331 
<212> PRT 

<213> Burkholderia cepacia 
<400> 30 

Met Thr He Asp Leu Lys Thr Arg Glu He Lys Pro Leu Arg His Thr 
1 5 10 15 

Tyr Thr His Val Ala Gin Tyr He Gly Ala Asp Lys Ala Ala Ser Arg 
' 20 25 30 

Tyr Gin Glu. Gly Thr Val Gly Ala Gin Pro Ala Ala Asn Phe His Tyr 
35 40 45 

Arg Pro Thr Trp Asp Pro Glu His Glu Leu Phe Asp Thr Ser Arg Thr 
50 55 60 

Ala He Gin Met Lys Asp Trp Tyr Ala Leu Lys Asp Pro Arg Gin Phe 
65 70 75 80 

Tyr Tyr Ala Ser Trp Thr Met Thr Arg Ala Arg Gin Gin Asp Ala Met 

85 90 95 

Glu Ser Asn Phe Glu Phe Val Glu Ser Arg Gly Met He Asp Leu Val 

100 105 no 

Ser Asp Glu Val Arg Gin Arg Ala Leu Ser Val Leu Val Pro Leu Arg 
H5 120 125 

His Ala Ala Trp Gly Ala Asn Met Asn Asn Ser Gin He Cys Ala Leu 
130 135 140 

Gly Tyr Gly Thr Thr Phe Thr Ala Pro Ala Met Phe His Ala Met Asp 
145 150 155 160 

Asn Leu Gly Val Ala Gin Tyr Leu Thr Arg Leu Ala Leu Val Met Ser 

165 ■ .170 • 175 

Gly Pro Asp Leu Leu Asp Glu Ala Lys Gin Ala Trp Met Thr Ser Pro 

180 185 ~ 190 

Asp Trp Gin Pro Leu Arg Arg Tyr Val Glu Asn Thr Leu Val Leu Gin 
195 200 205 

Asp Pro Val Glu Leu Phe He Ala Gin Asn Leu Ala Leu Asp Gly Leu 
210 215 220 

Leu Tyr Pro Met lie Tyr Gly Ala Phe Val Asp Asp Tyr He Ala Leu 
225 230 235 240 

Asn Gly Gly Ser Ala Val Ala Met Leu Thr Thr Phe Met Pro Glu Trp 

245 250 255 

His Asp Glu Ser Ser Arg Trp Val Asp Ala Val Val Lys Thr Met Ala 

260 265 270 

Thr Glu Ser Glu Asp Asn Lys Ala Leu Leu He His Trp Leu Arg Thr 
275 280 285 

Trp Glu Asp Gin Ala Ala Ser Ala Leu Leu Pro Val Ala Glu Met Ala 
290 295 300 

Leu Ala Glu Asn Gly His Asp Ala Leu Glu Glu Val Arg Gin Gin Leu 
305 310 315 320 

Arg Ala Arg Val Ala Lys Ala Gly He Val Leu 

3 25 330 



996 
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<210> 31 

<211> 357 

<212> DNA 

<213> Burkholderia cepacia 



<220> 

<221> CDS 

<222> (I).. (357) 



< !°° > m-f arc etc aaa ccc tac aag ttc ccg gca cga gac gcg 

£l tlr fax 2S Leu £ Pro Tyr Lyl Phe Pro ,1a Arg Asp Ala 

1 5 

= rca aca cca ttg ctg ttt ate ggc tgg- gaa gac cat 

S S" A- Phe Pro All Pro J Leu Phe He Gly Trp Glu Asp H3.S 

2 0 25 

ctg ttg ttt gcg gca cct gtt gee ttg ccc ctg ccg teg gac acg ttg 
Leu Leu Phe Ala Ala Pro Val Ala Leu Pro Leu Pro Ser Asp 

40 " 



35 



ttc ggt gcg ctg tgc acc cag gtg ttg ccc ggc act tat ggc tat cac 
Se GXy Ala Leu Cys Thr Gin Val Leu Pro Gly Thr Tyr Gly Tyr His 



50 



ccc gat ttc tea aag ate gac tgg age cag gtg cag tgg ttt aag tec 
Pro Asp Phe Ser Lys He Asp Trp Ser Gin Val Gin Trp Phe Lys ^ 



65 



70 75 



ggc cag ccg tgg cat ccc gac ccg gcg aag teg ctg get gaa aac ggt 
Gly Gin Pro Trp His Pro Asp Pro Ala Lys Ser Leu Ala Glu Asn Gly 

85 . 90 

ctg acg cac aaa gac gtg ate egc ttt cgc acg cct ggc ttg aac ggt 
Leu Thr His Lys Asp Val lie Arg Phe Arg inr rz.^ ^ ^ 



100 1° 5 

ctg age ggt tec tgc aat tga 
Leu Ser Gly Ser Cys Asn 
115 



<210> 32 

<211> 118 

<212> PRT 

<213> Burkholderia cepacia 

<400> 32 

Met Ser Val Val Ala Leu Lys Pro Tyr Lys Phe Pro Ala Arg Asp Ala 
1 * 10 

™ rci u Asn Phe Pro Ala Pro Leu 

20 25 



Arg Glu Asn Phe Pro Ala Pro Leu Leu Phe He Gly Trp Glu Asp His 



Leu Leu Phe Ala Ala Pro Val Ala Leu Pro Leu Pro Ser Asp Thr Leu 

35 4° 45 

Phe Gly Ala Leu Cys Thr Gin Val Leu Pro Gly Thr Tyr Gly Tyr His 

50 • 

Pro Asp Phe Ser Lys He Asp Trp Ser Gin Val Gin Trp Phe Lys Ser 

65 70 75 

Gly Gin Pro Trp His Pro Asp Pro Ala Lys Ser Leu Ala Glu Asn Gly 



85 



Leu Thr His Lys Asp Val He Arg Phe Arg Thr Pro Gly Leu Asn Gly 

100 105 

Leu Ser Gly Ser Cys Asn 
115 



96 



144 



192 



240 



288 



336 



357 
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<210> 33 

<211> 1143 

<212> DNA 

<213> Bacillus stearothermophilus 

<220> 

<221> CDS 

<222> (1)..(1143) 

<400> 33 

atg gaa aaa aat aaa atg tta ata gaa gaa aag ttg gac act get get 

Met Glu Lys Asn Lys Met Leu lie Glu Glu Lys Leu Asp Thr Ala Ala 
1 5 10 is 

ctt ctt get aag gcg gag gaa ata ggc egg att get gag gaa gag gcg 
Leu Leu Ala Lys Ala Glu Glu He Gly Arg He Ala Glu Glu Glu Ala 

20 25 30 

» 

ggt gaa gcg gac cgc aat gee tgt ttc tec gac egg gtg get agg gee 
Gly Glu Ala Asp Arg Asn Ala Cys Phe Ser Asp Arg Val Ala Arg Ala 
35 40 45 

att aaa gaa get gga ttc cac aag etc- atg cgt ccc aag cag tac gga 
He Lys Glu Ala Gly Phe His Lys Leu Met Arg Pro Lys Gin Tyr Gly 
50 55 6Q 

gga ctg caa gta gac ttg cga act tac ggg gag att gtc cgc aca gtg 
Gly Leu Gin Val Asp Leu Arg Thr Tyr Gly Glu He Val Arg Thr Val 
65 70 75 Q0 

gee egg tac agt gtt gee gca gga tgg ctg ace tat ttt tat tec atg 
Ala Arg Tyr Ser Val Ala Ala Gly Trp Leu Thr Tyr Phe Tyr Ser Met 

85 go * 95 

S?* ?? g ff, ^ gCa tat Ctg CCt cca aaa aga gaa gaa att 

His Glu Val Trp Ala Ala Tyr Leu Pro Pro Lys Gly Arg Glu Glu He 

100 105 no 

ttt gga caa gga ggg ctg ttg gca gac gtc gtt gee cct gtt ggc egg 
Phe Gly Gin Gly Gly Leu Leu Ala Asp Val Val Ala Pro Val Gly Arg 
H5 120 125 

gtg gag aag gac ggg gac ggc tac cgt etc tat ggg cag tgg aac ttc 
Val Glu Lys Asp Gly Asp Gly Tyr Arg Leu Tyr Gly Gin Trp Asn Phe 
130 135 140 

tgt age ggt gtc etc cat agt gac tgg ate gga ctt ggc gee atg atg 
Cys Ser Gly Val Leu His Ser Asp Trp He Gly Leu Gly Ala Met Met 
145 150 155 i 6 o 

gag ctg cct gac ggc aat agt cct gag tac tgt ttg tta gtg ctg cct 
Glu Leu Pro Asp Gly Asn Ser Pro Glu Tyr Cys Leu Leu Val Leu Pro 

165 • ivo 175 

aag teg gat gtc cag ate gta gaa aat tgg gat acc atg ggc etc cgc 
Lys Ser Asp Val Gin He Val Glu Asn Trp Asp Thr Met Gly Leu Arg 

180 185 190 

get teg gga age aac ggg gta tta gtt gaa ggt get tat gtt cca tta 
Ala Ser Gly Ser Asn Gly Val Leu Val Glu Gly Ala Tyr Val Pro Leu 
195 200 205 

cac egg ate ttt ccg get ggc egg gtg atg get cat ccg ctt ttc ttg 

His Arg He Phe Pro Ala Gly Arg Val Met Ala His Pro Leu Phe Leu 

210 215 220 

« 

ctt ggg ttc cct tta gta tct tta ggc ggc gac gaa cga ttg gtg tea 
Leu Gly Phe Pro Leu Val Ser Leu Gly Gly Asp Glu Arg Leu Val Ser 
225 230 235 240 

ctt ttc caa gaa cgc act gag aag cgc att cgt gtc ttc aaa ggc ggc 
Leu Phe Gin Glu Arg Thr Glu Lys Arg He Arg Val Phe Lys Gly Gly 

245 250 " 255 

gcg aaa gaa aag gat tct gee get age cag egg ctg tta gee gag atg 



72 



Ala Lys Glu Lys Asp Ser Ala Ala Ser Gin Arg Leu Leu Ala Glu Met 

260 265 270 

aaa aca gaa tta aat gca atg gaa ggc att gtg gaa caa tat ate cgc 8 64 

Lys Thr Glu Leu Asn Ala Met Glu Gly lie Val Glu Gin Tyr lie Arg 

275 280 " ' 285 

cag ctt gag get tgc caa aaa gaa gga aag acg gtg atg aac gat atg 912 

Gin Leu Glu Ala Cys Gin Lys Glu Gly Lys Thr Val Met Asn Asp Met 

290 295 300 

gag cga gag cag eta ttc gca tgg cgt gga tat gtg gca aaa gcg tec 960 

Glu Arg Glu Gin Leu Phe Ala Trp Arg Gly Tyr Val Ala Lys Ala Ser 

305 310 ~ 315 320 

gee aat att gec gtc aga aca ctg tta act ctt gga ggc aat teg ate 1008 

Ala Asn lie Ala Val Arg Thr Leu Leu Thr Leu Gly Gly Asn Ser lie 

325 330 335 

ttt aaa ggc gat ccg gta gaa ctg ttc aca aga gat ttg eta gcg gtg 1056 

Phe Lys Gly Asp Pro Val Glu Leu Phe Thr Arg Asp Leu Leu Ala Val 

340 345 350 

gee gca cat cct aac tec ctg tgg gag gat gcg atg get gca tat gga 1104 

Ala Ala His Pro Asn Ser Leu Trp Glu Asp Ala Met Ala Ala Tyr Gly 

355 360 ' 365 

aga acg ata ttc ggg ctg cca ggg gac cca gtc tgg taa 1143 

Arg Thr lie Phe Gly Leu Pro Gly Asp Pro Val Trp 

370 375 380 



<210> 34 

<211> 380 

<212> PRT 

<213> Bacillus stearothermophilus 

<400> 34 

Met Glu Lys Asn Lys Met Leu lie Glu Glu Lys Leu Asp Thr Ala Ala 
1 5 • 10 15 

Leu Leu Ala Lys Ala Glu Glu lie Gly Arg lie Ala Glu Glu Glu Ala 

20 25 30 

Gly Glu Ala Asp Arg Asn Ala Cys Phe Ser Asp Arg Val Ala Arg Ala 
35 40 45 

lie Lys Glu Ala Gly Phe His Lys Leu Met Arg Pro Lys Gin Tyr Gly 
50 55 60 

-Gly Leu Gin Val Asp Leu Arg Thr Tyr Gly Glu lie Val Arg Thr Val 
65 70 75 80 

Ala Arg Tyr Ser Val Ala Ala Gly Trp Leu Thr Tyr Phe Tyr Ser Met 

85 90 95 

His Glu Val Trp Ala Ala Tyr Leu Pro Pro Lys Gly Arg Glu Glu lie 

100 105 110 

Phe Gly Gin Gly Gly Leu Leu Ala Asp Val Val Ala Pro Val Gly Arg 
115 120 125 

Val Glu Lys Asp Gly Asp Gly Tyr Arg Leu Tyr ' Gly Gin Trp Asn Phe 
130 " 135 140 

Cys Ser Gly Val Leu His Ser Asp Trp lie Gly Leu Gly Ala Met Met 
145 150 155 160 

Glu Leu Pro Asp Gly Asn Ser Pro Glu Tyr Cys Leu Leu Val Leu Pro 

165 170 175 

Lys Ser Asp Val Gin lie Val Glu Asn Trp Asp Thr Met Gly Leu Arg 

180 185 190 

Ala Ser Gly Ser Asn Gly Val Leu Val Glu Gly Ala Tyr Val Pro Leu 
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195 200 



205 



His Arg lie Phe Pro Ala Gly Arg Val Met Ala His Pro Leu Phe Leu 
210 215 220 

Leu Gly Phe Pro Leu Val Ser Leu Gly Gly Asp Glu Arg Leu Val Ser 

230 235 240 

Leu Phe Gin Glu Arg Thr Glu Lys Arg He Arg Val Phe Lys Gly Gly 

245 250 255 

Ala Lys Glu Lys Asp Ser Ala Ala Ser Gin Arg Leu Leu Ala Glu Met 

260 265 270 

Lys Thr Glu Leu Asn Ala Met Glu Gly lie Val Glu Gin Tyr He Arg 
275 280 285 

Gin Leu Glu Ala Cys Gin Lys Glu Gly Lys Thr Val Met Asn Asp Met 
290 295 300 

Glu Arg Glu Gin Leu Phe Ala Trp Arg Gly Tyr Val Ala Lys Ala Ser 
305 ' 310 315 320 

Ala Asn He Ala Val Arg Thr Leu Leu Thr Leu Gly Gly Asn Ser He 

325 330 335 

Phe Lys Gly Asp Pro Val Glu Leu Phe Thr Arg Asp Leu Leu Ala Val 

340 345 350 

Ala Ala His Pro Asn Ser Leu Trp Glu Asp Ala Met Ala Ala Tyr Gly 
355 360 36 ^ * * 

Arg Thr He Phe Gly Leu Pro Gly Asp Pro Val Trp 
370 375 380 

<210> 35 

<211> 1191 

<212> DNA 

<213> Helianthus annuus 

<220> 

<221> CDS 

<222> (1) . . (1191) 

<400> 35 " 

atg gcg att cgc ate aat acg gcg acg ttt caa tea gac ctg tac cgt 
Met Ala He Arg He Asn Thr Ala Thr Phe Gin Ser Asp Leu Tyr Arg 
15 10 15 



c »k J CCt ° aa CCg aaa cct ctc a ^ a tct ccc ttc gec 

Ser Phe Ala Phe Pro Gin Pro Lys Pro Leu Arg Ser Pro Lys Phe Ala 

20 25 30 



48 



96 



144 



m ? ?? o acc att gga tec get aca acg aag gtt gaa age ace aaa 
Met Ala Ser Thr He Gly Ser Ala Thr Thr Lys Val Glu Ser Thr Lys 
35 40 45 

aag ccc ttt acc cct cca agg gag gtt cac caa cag gtg eta cac tea 192 
Lys Pro Phe Thr Pro Pro Arg Glu Val His Gin Gin Val Leu His Ser 
50 55 60 

atg ccg cca caa aag ate gaa ate ttc aaa tec atg gag ggt tgg gee 240 
Met Pro Pro Gin Lys He Glu He Phe Lys Ser Me? Glu Gly T ip La 
55 70 75 80 

gaa aat aac ata ttg gtt cac eta aag cct gtc gaa aaa tgc tgg caa 288 
Glu Asn Asn He Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 

85 90 " 95 

gca cag gat ttc eta cca gat ccc gca tct gac gga ttt atg gaa caa 336 
Ala Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Met Glu Gin 

100 105 no 

gtg gag gaa tta egg get egg get aag gag att ccg gat gat tac ttt 384 



v 



vr, a 
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Val Glu Glu Leu Arg Ala Arg Ala Lys Glu lie Pro Asp Asp Tyr Phe 
115 - 120 125 

att att ttg gtt gga gat atg att act gaa gaa gca ctg cct act tac 
Val vS Leu Val Gly Asp Met He Thr Glu Glu Ala Leu Pro Thr Tyr 
130 135 140 

caa aca atg ctt aat act ctt gat ggt gtg cgt gat gag acc ggg get 
Gin Thr Met Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 
145 150 i55 X0U 

acc eta ctt ctt ggg eta gtc tgg act egg get tgg acc get gaa gaa 
Thr Leu Leu Leu Gly Leu Val Trp Thr Arg Ala Trp Thr Ala Glu Glu 

165 I*? 0 1/b 

aac acrcr cac ggt gat ctt eta cat .cag tat ctg tat ctt agt ggg egg 
Tsn Arg His Gly Asp Leu Leu His Gin Tyr Leu Tyr Leu Ser Gly Arg 

180 185 I 90 

gtc gac atg agg cag att cag aag aca att cag tac etc att ggg tct 
Val Asp Met Arg Gin He Gin Lys Thr He Gin Tyr Leu He Gly Ser 
195 200 . 205 

gga atg gac ccc egg acc gaa aac agt cct tac ctt ggg ttc ate tac 
Gly Met Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe He Tyr 

215 220 



210 



anf tea ttt caa gag cgt gec acc ttc ate tct cac gga aac aca gee 
Thr Ser Phe Gin 111 Arg Ala Thr Phe He Ser His Gly Asn Thr Ala 
225 230 235 



cac cac gca aag gag cat ggt gac gtg aag ctg get caa atg tgc ggt 
Arg His Ala Lys Glu His Gly Asp Val Lys Leu Ala Gin Met. Cys Gly 

245 " 250 255 



ata att aca get gat gaa aaa agg cac gaa acc gec tac aca aaa at a 
fie He HI Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys He 

260 265 270 

gta gaa aaa etc ttc gaa att gac ccg gac ggc act gtt etc get ttt 
Val Glu Lys Leu Phe Glu He Asp Pro Asp Gly Thr Val Leu Ala Phe 
275 280 285 

acc aac atg atg agg aaa aag ate tec atg cct gca cac ttg atg tac 
lit Asp Me? Me? A?g Lys Lys He Ser Met Pro Ala His Leu Met Tyr 
290 295 300 

gat ggg cgt gat gat aac etc ttc gaa aat ttc tea get gtt gec caa 
Cp G?y A?g Asp Asp Asn Leu Phe Glu Asn Phe Ser Ala Val Ala Gin 
305 310 315 



370 



375 380 



ccg ttc age tgg ate ttt gat aga gaa gtg aag etc tga 
Pro Phe Ser Trp He Phe Asp Arg Glu Val Lys Leu 
385 390 395 



<210> 36 

<211> 396 

<212> PRT 

<213> Helianthus annuus 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



aaa etc ggt gtg tac act gcg aag gac tat gca gac att ctg gag ttt 
111 Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp He Leu Glu Phe 

325 330 335 

ctg gtg ggc egg tgg aag gtg gcg gat tta acc ggg ctt tct ggt gaa 1056 
Leu val lly A?g T?p Lys Val Ala Asp Leu Thr Gly Leu Ser Gly Glu 

340 345 J:3U 

ggg cgt aaa gec caa gac tat gtg tgc ggg ctg gec eca aga ate aga 
Gly A?g Lys Ala Gin Asp Tyr Val Cys Gly Leu Ala Pro Arg He Arg 
355 360 365 

agg ctt gag gag agg aac teg gca agg gcg aag gaa agt gtg aac gtt 
Arg Leu Glu Glu Arg Asn Ser Ala Arg Ala Lys Glu Ser Val Asn Val 



1104 



1152 



1191 
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<400> 36 

Met Ala lie Arg lie Asn Thr Ala Thr Phe Gin Ser Asp Leu Tyr Arg 
15 10 15 

Ser Phe Ala Phe Pro Gin Pro Lys Pro Leu Arg Ser Pro Lys Phe Ala 

20 25 30 

Met Ala Ser Thr lie Gly Ser Ala Thr Thr Lys Val Glu Ser Thr Lys 
35 40 45 

Lys Pro Phe Thr Pro Pro Arg Glu Val His Gin Gin Val Leu His Ser 
50 55 60 

Met Pro Pro Gin Lys He Glu He Phe Lys Ser Met Glu Gly Trp Ala 
65 70 75 " * 80 

Glu Asn Asn He Leu Val His Leu Lys Pro Val Glu Lys Cys Trp Gin 

85 90 95 

Ala Gin Asp Phe Leu Pro Asp Pro Ala Ser Asp Gly Phe Met Glu Gin 

100 105 110 

Val Glu Glu Leu Arg Ala Arg Ala Lys Glu He Pro Asp Asp Tyr Phe 
115 120 125 

Val Val Leu Val Gly Asp Met He Thr Glu GlU Ala Leu Pro Thr Tyr 
130 135 140 

Gin Thr Met Leu Asn Thr Leu Asp Gly Val Arg Asp Glu Thr Gly Ala 
145 150 155 160 

Thr Leu Leu Leu Gly Leu Val Trp Thr Arg Ala Trp Thr Ala Glu Glu 

165 170 175 

Asn Arg His Gly Asp Leu Leu His Gin Tyr Leu Tyr Leu Ser Gly Arg 

180 185 190 

Val Asp Met Arg Gin He Gin Lys Thr He Gin Tyr Leu He Gly Ser 
195 200 205 

Gly Met Asp Pro Arg Thr Glu Asn Ser Pro Tyr Leu Gly Phe He Tyr 
210 215 220 

Thr Ser Phe Gin Glu Arg Ala Thr Phe He Ser His Gly Asn Thr Ala 
225 230 235 240 

Arg His Ala Lys Glu His Gly Asp Val Lys Leu Ala Gin Met Cys Gly 

245 250 255 

He He Ala Ala Asp Glu Lys Arg His Glu Thr Ala Tyr Thr Lys He 

260 265 270 

Val Glu Lys Leu Phe Glu He Asp Pro Asp Gly Thr Val Leu Ala Phe 
275 280 285 

Ala Asp Met Met Arg Lys Lys He Ser Met Pro Ala His Leu Met Tyr 
290 295 300 

Asp Gly Arg Asp Asp Asn Leu Phe Glu Asn Phe Ser Ala Val Ala Gin 
305 310 315 320 

Arg Leu Gly Val Tyr Thr Ala Lys Asp Tyr Ala Asp He Leu Glu Phe 

■ 325 330 335 

Leu Val Gly Arg Trp 'Lys Val Ala Asp Leu Thr Gly Leu Ser Gly Glu 

340 345 350 

Gly Arg Lys Ala Gin Asp Tyr Val Cys Gly Leu Ala Pro Arg He Arg 
355 360 365 

Arg Leu Glu Glu Arg Asn Ser Ala Arg Ala Lys Glu Ser Val Asn Val 
370 375 380 

Pro Phe Ser Trp He Phe Asp Arg Glu Val Lys Leu 
385 390 395 
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CLAIMS 

1 . A method of carrying out an oxidation reaction catalysed by a 
monooxygenase enzyme and using hydrogen peroxide as an oxidant, in which 
reaction a low level of oxidation damage of the monooxygenase occurs, said method 
comprising producing the hydrogen peroxide simultaneously with the oxidation 
reaction, wherein the hydrogen peroxide is produced at a rate less than or equal to the 
rate at which it is used in the reaction. 

2. A method according to claim 1 , wherein the monooxygenase 
enzyme has a K m for H 2 0 2 of at least 1 5nM. 

3 . A method according to claim 1 or 2, wherein the monooxygenase 
enzyme is a P450 enzyme. 

4. A method according to any one of the preceding claims, wherein 
the rate of H 2 0 2 production is less than or equal to 3 \xg per mg of enzyme. 

* 

5. A method according to any one of the preceding claims, 
wherein the concentration of H 2 0 2 throughout the reaction is less than or equal to 1 
mM. 

6. A method according to any one of the preceding claims, wherein 
the reaction continues for at least 240 minutes. 

7. A method according to any one of the preceding claims, wherein 
the H 2 Q 2 is produced by an electrochemical reaction. 

8. A method according to any one of claims 1 to 6, wherein the H 2 0 2 
is produced by an enzyme reaction. 
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9. 



oxidage A meth ° d aCC ° rdin g t0 claim 8 > wherein the enzyme is glucose 



10. A method according to any one of claims 1 to 6, wherein the H 2 Q 2 is 
5 produced by a H 2 0 2 precursor. 



11. A method according to claim 1 0, wherein the H 2 Q 2 precursor is 
perborate, percarbonate or perphosphate. 

fh u j 2 ' A meth ° d aCC ° rding t0 ™y one of Preceding claims, wherein 
the substrate which is oxidi^H h« tho ™ 

compound t ! ^oxygenase enzyme is an alkane, aromatic 

compound, terpenoid compound, alkene or fatty acid. 



13. 



« as denned in caj^ " ^ * **" » — 



1 4. Use of an enzyme for producing H.O, to drive an oxidation reaction 
as defined in claim 8 or 9. 



0 15. 



USS ° f Perb ° rate ' P^onate or perphosphate for producing H 2 Q 2 
to drive an oxidation reaction as defined in claim 10. 



16. 



A method of carrying out an oxidation reaction catalysed by a 
^oxygenase enzyme and using hydrogen peroxide as an oxidant, in which 
reaction a low level of oxidation damage of the monooxygenase occurs, said method 
comprising carrying out the reaction in the presence of an H 2 G 2 or hydroxyl radical 
sequestering agent that controls the H 2 0 2 or hydroxyl radical concentration 



17. 



EDTA. 



A method according to claim 16, wherein the sequestering agent is 



